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TITLE OF THE INVENTION 
SYNTHETIC HEPATITIS C GENES 

CROSS-REFERENCE TO RELATED APPLICATIONS 
5 Not applicable. 

STATEMENT REGARDING FEDERALLY-SPONSORED R&D 
Not applicable. 

10 REFERENCE TO MICROFICHE APPENDIX 
Not applicable. 

FIELD OF THE INVENTION 
Not applicable. 

15 

BACKGROUND OF THE INVENTION 

This invention relates to novel nucleic acid pharmaceutical 
products, specifically nucleic acid vaccine products. The nucleic acid 
vaccine products, when introduced directly into muscle cells, induce the 
20 production of immune responses which specifically recognize Hepatitis 
C virus (HCV). 

Hepatitis C Virus 

Non-A, Non-B hepatitis (NANBH) is a transmissible disease 

25 (or family of diseases) that is believed to be viral ly induced, and is 

distinguishable from other forms of virus-associated liver disease, such 
as those caused by hepatitis A virus (HAV), hepatitis B virus (HBV), 
delta hepatitis virus (HDV), cytomegalovirus (CMV) or Epstein-Barr 
virus (EBV). Epidemiologic evidence suggests that there may be three 

30 types of NANBH: the water-borne epidemic type; the blood or needle 
associated type; and the sporadically occurring (community acquired) 
type. However, the number of causative agents is unknown. Recently, a 
new viral species, hepatitis C virus (HCV) has been identified as the 
primary (if not only) cause of blood-associated NANBH (BB-NANBH). 
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Hepatitis C appears to be the major form of transfusion-associated 
hepatitis in a number of countries, including the United States and 
Japan. There is also evidence implicating HCV in induction of 
hepatocellular carcinoma. Thus, a need exists for an effective method 
5 for preventing or treating HCV infection: currently, there is none. 

The HCV may be distantly related to the flaviviridae. The 
Flavivirus family contains a large number of viruses which are small, 
enveloped pathogens of man. The morphology and composition of 
Flavivirus particles are known, and are discussed in M. A. Brinton, in 

10 'The Viruses: The Togaviridae And Flaviviridae" (Series eds. Fraenkel- 
Conrat and Wagner, vol. eds. Schlesinger and Schlesinger, Plenum 
Press, 1986), pp. 327-374. Generally, with respect to morphology, 
Flaviviruses contain a central nucleocapsid surrounded by a lipid 
bilayer. Virions are spherical and have a diameter of about 40-50 nm. 

15 Their cores are about 25-30 nm in diameter. Along the outer surface of 
the virion envelope are projections measuring about 5-10 nm in length 
with terminal knobs about 2 nm in diameter. Typical examples of the 
family include Yellow Fever virus, West Nile virus, and Dengue Fever 
virus. They possess positive-stranded RNA genomes (about 1 1,000 

20 nucleotides) that are slightly larger than that of HCV and encode a 
polyprotein precursor of about 3500 amino acids. Individual viral 
proteins are cleaved from this precursor polypeptide. 

The genome of HCV appears to be single-stranded RNA 
containing about 10,000 nucleotides. The genome is positive-stranded, 

25 and possesses a continuous translational open reading frame (ORF) that 
encodes a polyprotein of about 3,000 amino acids. In the ORF, the 
structural proteins appear to be encoded in approximately the first 
quarter of the N-terminal region, with the majority of the polyprotein 
attributed to non-structural proteins. When compared with all known 

30 viral sequences, small but significant co-linear homologies are observed 
with the nonstructural proteins of the Flavivirus family, and with the 
pestiviruses (which are now also considered to be part of the Flavivirus 
family). 
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Intramuscular inoculation of polynucleotide constructs, i.e., 
DNA plasrnids encoding proteins have been shown to result in the in situ 
generation of the protein in muscle cells. By using cDNA plasrnids 
encoding viral proteins, both antibody and CTL responses were 
5 generated, providing homologous and heterologous protection against 
subsequent challenge with either the homologous or cross-strain 
protection, respectively. Each of these types of immune responses 
offers a potential advantage over existing vaccination strategies. The 
use of PNVs (polynucleotide vaccines) to generate antibodies may result 

10 in an increased duration of the antibody responses as well as the 
provision of an antigen that can have both the exact sequence of the 
clinically circulating strain of virus as well as the proper post- 
radiational modifications and conformation of the native protein (vs. a 
recombinant protein). The generation of CTL responses by this means 

15 offers the benefits of cross-strain protection without the use of a live 
potentially pathogenic vector or attenuated virus. 

Therefore, this invention contemplates methods for 
introducing nucleic acids into living tissue to induce expression of 
proteins. The invention provides a method for introducing viral 

20 proteins into the antigen processing pathway to generate virus-specific 
immune responses including, but not limited to, CTLs. Thus, the need 
for specific therapeutic agents capable of eliciting desired prophylactic 
immune responses against viral pathogens is met for HCV virus by this 
invention. Of particular importance in this therapeutic approach is the 

25 ability to induce T-cell immune responses which can prevent infections 
even of virus strains which are heterologous to the strain from which 
the antigen gene was obtained. Therefore, this invention provides DNA 
constructs encoding viral proteins of the hepatitis C virus core, envelope 
(El), nonstructural (NS5) genes or any other HCV genes which encode 

30 products which generate specific immune responses including but not 
limited to CTLs. 
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DNA Vaccines 

Benvenisty, N., and Reshef, L. [PNAS 83, 9551-9555, 
(1986)] showed that CaCl2-precipitated DNA introduced into mice 

intraperitoneal ly (i.p.), intravenously (i.v.) or intramuscularly (i.m.) 
5 could be expressed. The i.m. injection of DNA expression vectors 
without CaCl2 treatment in mice resulted in the uptake of DNA by the 
muscle cells and expression of the protein encoded by the DNA . The 
plasmids were maintained episomally and did not replicate. 
Subsequently, persistent expression has been observed after i.m. 

10 injection in skeletal muscle of rats, fish and primates, and cardiac 
muscle of rats. The technique of using nucleic acids as therapeutic 
agents was reported in WO90/1 1092 (4 October 1990), in which 
polynucleotides were used to vaccinate vertebrates. 

It is not necessary for the success of the method that 

15 immunization be intramuscular. The introduction of gold 

microprojectiles coated with DNA encoding bovine growth hormone 
(BGH) into the skin of mice resulted in production of anti-BGH 
antibodies in the mice. A jet injector has been used to transfect skin, 
muscle, fat, and mammary tissues of living animals. Various methods 

20 for introducing nucleic acids have been reviewed. Intravenous injection 
of a DNAxationic liposome complex in mice was shown by Zhu et aL, 
[Science 261:209-21 1 (9 July 1993) to result in systemic expression of a 
cloned transgene. Ulmer et aL, [Science 259:1745-1749, (1993)] 
reported on the heterologous protection against influenza virus infection 

25 by intramuscular injection of DNA encoding influenza virus proteins. 

The need for specific therapeutic and prophylactic agents 
capable of eliciting desired immune responses against pathogens and 
tumor antigens is met by the instant invention. Of particular 
importance in this therapeutic approach is the ability to induce T-cell 

30 immune responses which can prevent infections or disease caused even 
by virus strains which are heterologous to the strain from which the 
antigen gene was obtained. This is of particular concern when dealing 
with HIV as this virus has been recognized to mutate rapidly and many 
virulent isolates have been identified [see, for example, LaRosa et al M 
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Science 249:932-935 (1990), identifying 245 separate HIV isolates]. In 
response to this recognized diversity, researchers have attempted to 
generate CTLs based on peptide immunization. Thus, Takahashi et al., 
[Science 255:333-336 (1992)] reported on the induction of broadly 
5 cross-reactive cytotoxic T cells recognizing an HIV envelope (gpl60) 
determinant. However, those workers recognized the difficulty in 
achieving a truly cross-reactive CTL response and suggested that there 
is a dichotomy between the priming or restimulation of T cells, which is 
very stringent, and the elicitation of effector function, including 

10 cytotoxicity, from already stimulated CTLs. 

Wang et al. reported on elicitation of immune responses in 
mice against HIV by intramuscular inoculation with a cloned, genomic 
(unspliced) HIV gene. However, the level of immune responses 
achieved in these studies was very low. In addition, the Wang et al., 

15 DNA construct utilized an essentially genomic piece of HIV encoding 
contiguous Tat//?£V-gpl60-Tat//?£V coding sequences. As is described 
in detail below, this is a suboptimal system for obtaining high-level 
expression of the gpl60. It also is potentially dangerous because 
expression of Tat contributes to the progression of Karposi's Sarcoma. 

20 WO 93/17706 describes a method for vaccinating an animal 

against a virus, wherein carrier particles were coated with a gene 
construct and the coated particles are accelerated into cells of an animal. 

The instant invention contemplates any of the known 
methods for introducing polynucleotides into living tissue to induce 

25 expression of proteins. However, this invention provides a novel 
immunogen for introducing proteins into the antigen processing 
pathway to efficiently generate specific CTLs and antibodies. 

Codon Usage and Codon Context 
30 The codon pairings of organisms are highly nonrandom, 

and differ from organism to organism. This information is used to 
construct and express altered or synthetic genes having desired levels of 
translational efficiency, to determine which regions in a genome are 
protein coding regions, to introduce translational pause sites into 
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heterologous genes, and to ascertain relationship or ancestral origin of 
nucleotide sequences 

The expression of foreign heterologous genes in 
transformed organisms is now commonplace. A large number of 
5 mammalian genes, including, for example, murine and human genes, 
have been successfully inserted into single celled organisms. Standard 
techniques in this regard include introduction of the foreign gene to be 
expressed into a vector such as a plasmid or a phage and utilizing that 
vector to insert the gene into an organism. The native promoters for 

10 such genes are commonly replaced with strong promoters compatible 
with the host into which the gene is inserted. Protein sequencing 
machinery permits elucidation of the amino acid sequences of even 
minute quantities of native protein. From these amino acid sequences, 
DNA sequences coding for those proteins can be inferred. DNA 

15 synthesis is also a rapidly developing art, and synthetic genes 
corresponding to those inferred DNA sequences can be readily 
constructed. 

Despite the burgeoning knowledge of expression systems 
and recombinant DNA, significant obstacles remain when one attempts 

20 to express a foreign or synthetic gene in an organism. Many native, 
active proteins, for example, are glycosylated in a manner different 
from that which occurs when they are expressed in a foreign host. For 
this reason, eukaryotic hosts such as yeast may be preferred to bacterial 
hosts for expressing many mammalian genes. The glycosylation 

25 problem is the subject of continuing research. 

Another problem is more poorly understood. Often 
translation of a synthetic gene, even when coupled with a strong 
promoter, proceeds much less efficiently than would be expected. The 
same is frequently true of exogenous genes foreign to the expression 

30 organism. Even when the gene is transcribed in a sufficiently efficient 
manner that recoverable quantities of the translation product are 
produced, the protein is often inactive or otherwise different in 
properties from the native protein. 
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It is recognized that the latter problem is commonly due to 
differences in protein folding in various organisms. The solution to this 
problem has been elusive, and the mechanisms controlling protein 
folding are poorly understood. 
5 The problems related to translational efficiency are 

believed to be related to codon context effects. The protein coding 
regions of genes in all organisms are subject to a wide variety of 
functional constraints, some of which depend on the requirement for 
encoding a properly functioning protein, as well as appropriate 

10 translational start and stop signals. However, several features of protein 
coding regions have been discerned which are not readily understood in 
terms of these constraints. Two important classes of such features are 
those involving codon usage and codon context. 

It is known that codon utilization is highly biased and varies 

15 considerably between different organisms. Codon usage patterns have 
been shown to be related to the relative abundance of tRN A 
isoacceptors. Genes encoding proteins of high versus low abundance 
show differences in their codon preferences. The possibility that biases 
in codon usage alter peptide elongation rates has been widely discussed. 

20 While differences in codon use are associated with differences in 

translation rates, direct effects of codon choice on translation have been 
difficult to demonstrate. Other proposed constraints on codon usage 
patterns include maximizing the fidelity of translation and optimizing 
the kinetic efficiency of protein synthesis. 

25 Apart from the non-random use of codons, considerable 

evidence has accumulated that codon/anticodon recognition is influenced 
by sequences outside the codon itself, a phenomenon termed "codon 
context." There exists a strong influence of nearby nucleotides on the 
efficiency of suppression of nonsense codons as well as missense codons. 

30 Clearly, the abundance of suppressor activity in natural bacterial 
populations, as well as the use of "termination" codons to encode 
selenocysteine and phosphoserine require that termination be context- 
dependent. Similar context effects have been shown to influence the 
fidelity of translation, as well as the efficiency of translation initiation. 
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Statistical analyses of protein coding regions of E. coli have 
demonstrate another manifestation of "codon context.' 1 The presence of 
a particular codon at one position strongly influences the frequency of 
occurrence of certain nucleotides in neighboring codons, and these 
5 context constraints differ markedly for genes expressed at high versus 
low levels. Although the context effect has been recognized, the 
predictive value of the statistical rules relating to preferred nucleotides 
adjacent to codons is relatively low. This has limited the utility of such 
nucleotide preference data for selecting codons to effect desired levels 

10 of translational efficiency. 

The advent of automated nucleotide sequencing equipment 
has made available large quantities of sequence data for a wide variety 
of organisms. Understanding those data presents substantial difficulties. 
For example, it is important to identify the coding regions of the 

15 genome in order to relate the genetic sequence data to protein 

sequences. In addition, the ancestry of the genome of certain organisms 
is of substantial interest. It is known that genomes of some organisms 
are of mixed ancestry. Some sequences that are viral in origin are now 
stably incorporated into the genome of eukaryotic organisms. The viral 

20 sequences themselves may have originated in another substantially 
unrelated species. An understanding of the ancestry of a gene can be 
important in drawing proper analogies between related genes and their 
translation products in other organisms. 

There is a need for a better understanding of codon context 

25 effects on translation, and for a method for determining the appropriate 
codons for any desired translational effect. There is also a need for a 
method for identifying coding regions of the genome from nucleotide 
sequence data. There is also a need for a method for controlling protein 
folding and for insuring that a foreign gene will fold appropriately 

30 when expressed in a host. Genes altered or constructed in accordance 
with desired translational efficiencies would be of significant worth. 

Another aspect of the practice of recombinant DNA 
techniques for the expression by microorganisms of proteins of 
industrial and pharmaceutical interest is the phenomenon of "codon 
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preference M . While it was earlier noted that the existing machinery for 
gene expression is genetically transformed host cells will "operate 11 to 
construct a given desired product, levels of expression attained in a 
microorganism can be subject to wide variation, depending in part on 
5 specific alternative forms of the amino acid-specifying genetic code 
present in an inserted exogenous gene. A "triplet" codon of four 
possible nucleotide bases can exist in 64 variant forms. That these 
forms provide the message for only 20 different amino acids (as well as 
transcription initiation and termination) means that some amino acids 

10 can be coded for by more than one codon. Indeed, some amino acids 
have as many as six "redundant", alternative codons while some others 
have a single, required codon. For reasons not completely understood, 
alternative codons are not at all uniformly present in the endogenous 
DNA of differing types of cells and there appears to exist a variable 

15 natural hierarchy or "preference" for certain codons in certain types of 
cells. 

As one example, the amino acid leucine is specified by any 
of six DNA codons including CTA, CTC, CTG, CTT, TTA, and TTG 
(which correspond, respectively, to the mRNA codons, CUA, CUC, 

20 CUG, CUU, UUA and UUG). Exhaustive analysis of genome codon 
frequencies for microorganisms has revealed endogenous DNA of IL 
coli most commonly contains the CTG leucine-specifying codon, while 
the DNA of yeasts and slime molds most commonly includes a TTA 
leucine-specifying codon. In view of this hierarchy, it is generally held 

25 that the likelihood of obtaining high levels of expression of a leucine- 
rich polypeptide by an E. coli host will depend to some extent on the 
frequency of codon use. For example, a gene rich in TTA codons will 
in all probability be poorly expressed in E. coli . whereas a CTG rich 
gene will probably highly express the polypeptide. Similarly, when 

30 yeast cells are the projected transformation host cells for expression of a 
leucine-rich polypeptide, a preferred codon for use in an inserted DNA 
would be TTA. 

The implications of codon preference phenomena on 
recombinant DNA techniques are manifest, and the phenomenon may 
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serve to explain many prior failures to achieve high expression levels of 
exogenous genes in successfully transformed host organisms-a less 
"preferred" codon may be repeatedly present in the inserted gene and 
the host cell machinery for expression may not operate as efficiently. 
5 This phenomenon suggests that synthetic genes which have been 

designed to include a projected host cell's preferred codons provide a 
preferred form of foreign genetic material for practice of recombinant 
DNA techniques. 

10 Protein Trafficking 

The diversity of function that typifies eukaryotic cells 
depends upon the structural differentiation of their membrane 
boundaries. To generate and maintain these structures, proteins must be 
transported from their site of synthesis in the endoplasmic reticulum to 

15 predetermined destinations throughout the cell. This requires that the 
trafficking proteins display sorting signals that are recognized by the 
molecular machinery responsible for route selection located at the 
access points to the main trafficking pathways. Sorting decisions for 
most proteins need to be made only once as they traverse their 

20 biosynthetic pathways since their final destination, the cellular location 
at which they perform their function, becomes their permanent 
residence. 

Maintenance of intracellular integrity depends in part on 
the selective sorting and accurate transport of proteins to their correct 
25 destinations. Over the past few years the dissection of the molecular 
machinery for targeting and localization of proteins has been studied 
vigorously. Defined sequence motifs have been identified on proteins 
which can act as 'address labels'. A number of sorting signals have been 
found associated with the cytoplasmic domains of membrane proteins. 

30 

SUMMARY OF THE INVENTION 

This invention relates to novel formulations of nucleic acid 
pharmaceutical products, specifically nucleic acid vaccine products. 
The nucleic acid products, when introduced directly into muscle cells, 



WO 97/47358 



PCT/US97/09884 



- 11 - 

induce the production of immune responses which specifically recognize 
Hepatitis C virus (HCV). 

BRIEF DESCRIPTION OF THE DRAWINGS 
5 Figure 1 shows the nucleotide sequence of the VIRa vector. 

Figure 2 is a diagram of the VIRa vector. 
Figure 3 is a diagram of the Vtpa vector. 
Figure 4 is the VUb vector 

Figure 5 shows an optimized sequence of the HCV core 

10 antigen. 

Figure 6 shows VIRa.HCVlCorePAb, Vtpa.HCVlCorePAb 
and VUb.HCVlCorePAb. 

Figure 7 shows the Hepatitis C Virus Core Antigen 

Sequence. 

15 Figure 8 shows codon utilization in human protein-coding 

sequences (from Lathe et al.). 

Figure 9 shows an optimized sequence of the HCV El 

protein. 

Figure 10 shows an optimized sequence of the HCV E2 

20 protein. 

Figure 1 1 shows an optimized sequence of the HCV El +E2 

proteins. 

Figure 12 shows an optimized sequence of the HCV NS5a 

protein. 

25 Figure 1 3 shows an optimized sequence of the HCV NS5b 

protein. 

DETAILED DESCRIPTION OF THE INVENTION 

This invention relates to novel formulations of nucleic acid 
30 pharmaceutical products, specifically nucleic acid vaccine products. 

The nucleic acid vaccine products, when introduced directly into muscle 
cells, induce the production of immune responses which specifically 
recognize Hepatitis C virus (HCV). 
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Non-A, Non-B hepatitis (NANBH) is a transmissible disease 
(or family of diseases) that is believed to be virally induced, and is 
distinguishable from other forms of virus-associated liver disease, such 
as those caused by hepatitis A virus (HAV), hepatitis B virus (HBV), 
5 delta hepatitis virus (HDV), cytomegalovirus (CMV) or Epstein-Barr 
virus (EBV). Epidemiologic evidence suggests that there may be three 
types of NANBH: the water-borne epidemic type; the blood or needle 
associated type; and the sporadically occurring (community acquired) 
type. However, the number of causative agents is unknown. Recently, a 

10 new viral species, hepatitis C virus (HCV) has been identified as the 
primary (if not only) cause of blood-associated NANBH (BB-NANBH). 
Hepatitis C appears to be the major form of transfusion-associated 
hepatitis in a number of countries, including the United States and 
Japan. There is also evidence implicating HCV in induction of 

15 hepatocellular carcinoma. Thus, a need exists for an effective method 
for preventing or treating HCV infection: currently, there is none. 

The HCV may be distantly related to the flaviviridae. The 
Flavivirus family contains a large number of viruses which are small, 
enveloped pathogens of man. The morphology and composition of 

20 Flavivirus particles are known, and are discussed in M. A. Brinton, in 
"The Viruses: The Togaviridae And Flaviviridae" (Series eds. Fraenkel- 
Conrat and Wagner, vol. eds. Schlesinger and Schlesinger, Plenum 
Press, 1986), pp. 327-374. Generally, with respect to morphology, 
Flaviviruses contain a central nucleocapsid surrounded by a lipid 

25 bilayer. Virions are spherical and have a diameter of about 40-50 nm. 
Their cores are about 25-30 nm in diameter. Along the outer surface of 
the virion envelope are projections measuring about 5-10 nm in length 
with terminal knobs about 2 nm in diameter. Typical examples of the 
family include Yellow Fever virus, West Nile virus, and Dengue Fever 

30 virus. They possess positive-stranded RNA genomes (about 1 1 ,000 
nucleotides) that are slightly larger than that of HCV and encode a 
polyprotein precursor of about 3500 amino acids. Individual viral 
proteins are cleaved from this precursor polypeptide. 
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The genome of HCV appears to be single-stranded RNA 
containing about 10,000 nucleotides. The genome is positive-stranded, 
and possesses a continuous translational open reading frame (ORF) that 
encodes a polyprotein of about 3,000 amino acids. In the ORF, the 
5 structural proteins appear to be encoded in approximately the first 
quarter of the N-terminal region, with the majority of the polyprotein 
attributed to non-structural proteins. When compared with all known 
viral sequences, small but significant co-linear homologies are observed 
with the nonstructural proteins of the Flavivirus family, and with the 
10 pestiviruses (which are now also considered to be part of the Flavivirus 
family). 

Intramuscular inoculation of polynucleotide constructs, i.e., 
DNA plasmids encoding proteins have been shown to result in the 
generation of the encoded protein in situ in muscle cells. By using 

15 cDNA plasmids encoding viral proteins, both antibody and CTL 
responses were generated, providing homologous and heterologous 
protection against subsequent challenge with either the homologous or 
cross-strain protection, respectively. Each of these types of immune 
responses offers a potential advantage over existing vaccination 

20 strategies. The use of PNVs (polynucleotide vaccines) to generate 

antibodies may result in an increased duration of the antibody responses 
as well as the provision of an antigen that can have both the exact 
sequence of the clinically circulating strain of virus as well as the 
proper post-translational modifications and conformation of the native 

25 protein (vs. a recombinant protein). The generation of CTL responses 
by this means offers the benefits of cross-strain protection without the 
use of a live potentially pathogenic vector or attenuated virus. 

The standard techniques of molecular biology for 
preparing and purifying DNA constructs enable the preparation of the 

30 DNA therapeutics of this invention. While standard techniques of 
molecular biology are therefore sufficient for the production of the 
products of this invention, the specific constructs disclosed herein 
provide novel therapeutics which surprisingly produce cross-strain 
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protection, a result heretofore unattainable with standard inactivated 
whole virus or subunit protein vaccines. 

The amount of expressible DNA to be introduced to a 
vaccine recipient will depend on the strength of the transcriptional and 
5 translational promoters used in the DNA construct, and on the 
immunogenicity of the expressed gene product. In general, an 
immunologically or prophylactically effective dose of about 1 fig to 1 
mg, and preferably about 10 ^ig to 300 \ig is administered directly into 
muscle tissue. Subcutaneous injection, intradermal introduction, 
10 impression through the skin, and other modes of administration such as 
intraperitoneal, intravenous, or inhalation delivery are also 
contemplated. It is also contemplated that booster vaccinations are to be 
provided. 

The DNA may be naked, that is, unassociated with any 

15 proteins, adjuvants or other agents which impact on the recipients 
immune system. In this case, it is desirable for the DNA to be in a 
physiologically acceptable solution, such as, but not limited to, sterile 
saline or sterile buffered saline. Alternatively, the DNA may be 
associated with surfactants, liposomes, such as lecithin liposomes or 

20 other liposomes known in the art, as a DNA-liposome mixture, (see for 
example WO93/24640) or the DNA may be associated with an adjuvant 
known in the art to boost immune responses, such as a protein or other 
carrier. Agents which assist in the cellular uptake of DNA, such as, but 
not limited to, calcium ions, detergents, viral proteins and other 

25 transfection facilitating agents may also be used to advantage. These 
agents are generally referred to as transfection facilitating agents and as 
pharmaceutically acceptable carriers. As used herein, the term gene 
refers to a segment of nucleic acid which encodes a discrete polypeptide. 
The term pharmaceutical, and vaccine are used interchangeably to 

30 indicate compositions useful for inducing immune responses. The terms 
construct, and plasmid are used interchangeably. The term vector is 
used to indicate a DNA into which genes may be cloned for use 
according to the method of this invention. 
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The following examples are provided to further define the 
invention, without limiting the invention to the specifics of the 
examples. 



5 EXAMPLE 1 

V1J EXPRESSION VECTORS: 

VI J is derived from vectors VI and pUC18, a 
commercially available plasmid. VI was digested with Sspl and EcoRI 
restriction enzymes producing two fragments of DNA. The smaller of 

10 these fragments, containing the CMVintA promoter and Bovine Growth 
Hormone (BGH) transcription termination elements which control the 
expression of heterologous genes, was purified from an agarose 
electrophoresis gel. The ends of this DNA fragment were then 
"blunted" using the T4 DNA polymerase enzyme in order to facilitate 

15 its ligation to another "blunt-ended" DNA fragment. 

pUC18 was chosen to provide the "backbone" of the 
expression vector. It is known to produce high yields of plasmid, is 
well-characterized by sequence and function, and is of minimum size. 
We removed the entire lac operon from this vector, which was 

20 unnecessary for our purposes and may be detrimental to plasmid yields 
and heterologous gene expression, by partial digestion with the Haell 
restriction enzyme. The remaining plasmid was purified from an 
agarose electrophoresis gel, blunt-ended with the T4 DNA polymerase , 
treated with calf intestinal alkaline phosphatase, and ligated to the 

25 CMVintA/BGH element described above. Plasmids exhibiting either of 
two possible orientations of the promoter elements within the pUC 
backbone were obtained. One of these plasmids gave much higher 
yields of DNA in E. coli and was designated VI J. This vector's 
structure was verified by sequence analysis of the junction regions and 

30 was subsequently demonstrated to give comparable or higher expression 
of heterologous genes compared with VI. The ampicillin resistance 
marker was replaced with the neomycin resistance marker to yield 
vector VUneo. 
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An Sfi I site was added to VUneo to facilitate integration 
studies. A commercially available 13 base pair Sfi I linker (New 
England BioLabs) was added at the Kpn I site within the BGH sequence 
of the vector. VI Jneo was linearized with Kpn I, gel purified, blunted 
5 by T4 DNA polymerase, and ligated to the blunt Sfi I linker. Clonal 
isolates were chosen by restriction mapping and verified by sequencing 
through the linker. The new vector was designated VUns. Expression 
of heterologous genes in VUns (with Sfi I) was comparable to 
expression of the same genes in VI Jneo (with Kpn I). 

10 Vector VIRa (Sequence is shown in Figure I; map is shown 

in Figure 2) was derived from vector V1R, a derivative of the VUns 
vector. Multiple cloning sites (BglU 9 Kpnl, EcoRV, EcoRl, Sail, and 
Notl) were introduced into V1R to create the VIRa vector to improve 
the convenience of subcloning. VIRa vector derivatives containing the 

15 tpa leader sequence and ubiquitin sequence were generated (Vtpa 
(Figure 3) and Vub (Figure 4), respectively). Expression of viral 
antigen from Vtpa vector will target the antigen protein into the 
exocytic pathway, thus producing a secretable form of the antigen 
proteins. These secreted proteins are likely to be captured by 

20 professional antigen presenting cells, such as macrophages and dendritic 
cells, and processed and presented by class II molecules to activate CD4+ 
Th cells. They also are more likely to efficiently simulate antibody 
responses. Expression of viral antigen through VUb vector will 
produce a ubiquitin and antigen fusion protein. The uncleavable 

25 ubiquitin segment (glycine to alanine change at the cleavage site, Butt et 
al., JBC 263:16364, 1988) will target the viral antigen to ubiquitin- 
associated proteasomes for rapid degradation. The resulting peptide 
fragments will be transported into the ER for antigen presentation by 
class I molecules. This modification is attempted to enhance the class I 

30 molecule-restricted CTL responses against the viral antigen (Townsend 
et al, JEM 168:1211, 1988). 
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EXAMPLE 2 

DESIGN AND CONSTRUCTION OF THE SYNTHETIC GENES 

A. Design of Synthetic Gene Segments for HCV Gene Expression : 
5 Gene segments were converted to sequences having 

identical translated sequences (except where noted) but with alternative 
codon usage as defined by R. Lathe in a research article from 7. Molec. 
Biol. Vol. 183, pp. 1-12 (1985) entitled "Synthetic Oligonucleotide 
Probes Deduced from Amino Acid Sequence Data: Theoretical and 

10 Practical Considerations". The methodology described below was based 
on our hypothesis that the known inability to express a gene efficiently 
in mammalian cells is a consequence of the overall transcript 
composition. Thus, using alternative codons encoding the same protein 
sequence may remove the constraints on HCV gene expression. 

15 Inspection of the codon usage within HCV genome revealed that a high 
percentage of codons were among those infrequently used by highly 
expressed human genes. The specific codon replacement method 
employed may be described as follows employing data from Lathe et 
ah: 

20 1 . Identify placement of codons for proper open 

reading frame. 

2. Compare wild type codon for observed frequency of 
use by human genes (refer to Table 3 in Lathe et ah). 

3. If codon is not the most commonly employed, 

25 replace it with an optimal codon for high expression based on data in 
Table 5. 

4. Inspect the third nucleotide of the new codon and the 
first nucleotide of the adjacent codon immediately 3'- of the first. If a 
5'-CG-3' pairing has been created by the new codon selection, replace it 

30 with the choice indicated in Table 5. 

5. Repeat this procedure until the entire gene segment 
has been replaced. 

6. Inspect new gene sequence for undesired sequences 
generated by these codon replacements (e.g., "ATTTA" sequences, 
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inadvertent creation of intron splice recognition sites, unwanted 
restriction enzyme sites, etc.) and substitute codons that eliminate these 
sequences. 

7. Assemble synthetic gene segments and test for 
5 improved expression. 

B. HCV CORE ANTIGEN SEQUENCE 

The consensus core sequence of HCV was adopted from a 
generalized core sequence reported by Bukh et al. (PNAS, 91:8239, 
10 1994). This core sequence contains all the identified CTL epitopes in 
both human and mouse. The gene is composed of 573 nucleotides and 
encodes 191 amino acids. The predicted molecular weight is about 23 
kDa. 

The codon replacement was conducted to eliminate codons 
1 5 which may hinder the expression of the HCV core protein in transfected 
mammalian cells in order to maximize the translational efficiency of 
DNA vaccine. Twenty three point two percent (23.2%) of nucleotide 
sequence (133 out of 573 nucleotides) were altered, resulting in changes 
of 61.3% of the codons (1 17 out 191 codons) in the core antigen 
20 sequence. The optimized nucleotide sequence of HCV core is shown in 
Figure 5. 

C CONSTRUCTION OF THE SYNTHETIC CORE GENE 

The optimized HCV core gene (Figure 5) was constructed 

25 as a synthetic gene annealed from multiple synthetic oligonucleotides. 
To facilitate the identification and evaluation of the synthetic gene 
expression in cell culture and its immunogenicity in mice, a CTL 
epitope derived from influenza virus nucleoprotein residues 366-374 
and an antibody epitope sequence derived from SV40 T antigen residues 

30 684-698 were tagged to the carboxyl terminal of the core sequence 
(Figure 6). For clinical use it may be desired to express the core 
sequence without the nucleoprotein 366-374 and SV40 T 684-698 
sequences. For this reason, the sequence of the two epitopes is flanked 
by two EcoRl sites which will be used to excise this fragment of 



WO 97/47358 



PCT/US97/09884 



19 



sequence at a later time. Thus an embodiment of the invention for 
clinical use could consist of the VIRa.HCVlCorePAb, 
Vtpa.HCVlCorePAb, or VUb.HCV ICorePAb plasmids that had been 
cut with EcoRI, annealed, and ligated to yield plasmids 
5 V 1 Ra.HCV 1 Core, Vtpa.HCV 1 Core, and VUb.HCV 1 Core. 

The synthetic gene was built as three separate segments in 
three vectors, nucleotides 1 to 80 in VIRa, nucleotides 80 to 347 (BstXl 
site) in pUC18, and nucleotides 347 to 573 plus the two epitope 
sequence in pUCl 8. All the segments were verified by DNA 
10 sequencing, and joined together in V IRa vector. 

D. HCV Gene Expression Constructs: 

In each case, the junction sequences from the 5' promoter 
region (CMVintA) into the cloned gene is shown. The position at which 
15 the junction occurs is demarcated by a 7", which does not represent any 
discontinuity in the sequence. 

The nomenclature for these constructs follows the 
convention: "Vector name-HCV strain-gene". 

20 

VlRa.HCVl.CorePAb 
— IntA— AGA TCT ACC / ATG AGC-HCV.Core.-GCC / GAA TTC GCT TCC- 
PAb Sequence-TAA / ACC CGG GAA TTC TA A A / GTC G AC-BGH— 

25 Vtpa.HCVl.CorePAb 

— IntA-ATC ACC / ATG GAT-tpa leader-GAG ATC-TTC / ATG AGC- 
HCV.Core.-GCC / GAA TTC GCT TCC-PAb Sequence-TAA / ACC CGG GAA 
TTC TAA A / GTC GAC-BGH— 

30 VUb.HCV l.CorePAb. 

—IntA— AGA TCC ACC / ATG CAG-Ubiquitin~GGT GCA GAT CTG/ ATG AGC- 
HCV.Core.~GCC / GAA TTC GCT TCC-PAb Sequence-TAA / ACC CGG GAA 
TTC TAA A / GTC G AC-BGH— 



WO 97/47358 



PCT/US97/09884 



-20- 



VlRa.HCVl.Core 

— IntA-AGA TCT ACC / ATG AGC~HCV.Core.-GCC / TAA A / GTC GAC- 
BGH — 



5 Vtpa.HCVl.Core 

— IntA-ATC ACC / ATG GAT--tpa leader--GAG ATC-TTC / ATG AGC-- 
HCV.Core.-GCC / TAA A / GTC GAC-BGH — 

VUb.HCV 1 .Core 

10 — LntA-AGA TCC ACC / ATG CAG-Ubiquitin-GGT GCA GAT CTG/ ATG AGC-- 
HCV.Core.--GCC / TAA A / GTC GAC-BGH— 



E. OTHER SYNTHETIC HCV GENES 

Using similar codon optimization techniques, synthetic 
15 genes encoding the HCV El (Figure 9), HCV E2 (Figure 10), HCV 

E1+E2 (Figure 11), HCV NS5a (Figure 12) and HCV NS5b (Figure 13) 
proteins were created. 
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WHAT IS CLAIMED: 



1 . A synthetic polynucleotide comprising a DNA 
sequence encoding an HCV protein selected from the group consisting 
5 of HCV core protein, HCV El protein, HCV E1+E2 protein, HCV NS5a 
protein, HCV NS5b protein and fragments thereof, the DNA sequence 
comprising codons optimized for expression in a vertebrate host. 



2. A plasmid vector comprising the polynucleotide of 
10 Claim 1, the plasmid vector being suitable for immunization of a 
vertebrate host. 



3. The polynucleotide of Claim 1 which is HCV 
genotype I/Ia core. 

15 

4. The polynucleotide of Claim 1 having the sequence 

ATGAGCACcA AcCCcAAgCC cCAgAGgAAg ACCAAgaGgA ACACCAACaG gaGgCCcCAG 
GAtGTgAAGT TCCCtGGgGG aGGcCAGATt GTgGGaGGgG TcTACcTGcT GCCcaGgAGG 
GGCCCCAGGc TGGGgGTGaG gGCtACcaGG AAGACcTCtG AGaGGTCcCA gCCcaGgGGc 

20 AGGaGgCAGC CcATCCCCAA GGCcaGgaGG CCtGAGGGCc GcTCCTGGGC cCAGCCtGGc 
TACCCcTGGC CCCTgTATGG CAATGAaGGC TTtGGcTGGG CtGGcTGGCT gCTGTCCCCC 
aGgGGCTCca GGCCctccTG GGGCCCCACa GACCCCaGGa GgaGGTCcaG gAAccTGGGc 
AAGGTgATtG AcACCCTgAC cTGtGGCTTt GCtGACCTgA TGGGcTACAT CCCcCTgGTg 
GGgGCtCCtG TgGGaGGgGT gGCtAGGGCt CTGGCtCATG GgGTgAGGGT gCTGGAGGAt 

25 GGGGTGAACT ATGCtACtGG cAAccTGCCt GGcTGCTCcT TCTCcATCTT CCTgCTGGCc 
CTGCTcTCCT GCCTGACaGT gCCtGCTTCT GCc 



5. The plasmid vector of Claim 2 having the sequence 

30 GATATTGGCT ATTGGCCATT GCATACGTTG TATCCATATC ATAATATGTA CATTTATATT 
GGCTCATGTC CAACATTACC GCCATGTTGA CATTGATTAT TGACTAGTTA TTAATAGTAA 
TCAATTACGG GGTCATTAGT TCATAGCCCA TATATGGAGT TCCGCGTTAC ATAACTTACG 
GTAAATGGCC CGCCTGGCTG ACCGCCCAAC GACCCCCGCC CATTGACGTC AATAATGACG 
TATGTTCCCA TAGTAACGCC AATAGGGACT TTCCATTGAC GTCAATGGGT GGAGTATTTA 

35 CGGTAAACTG CCCACTTGGC AGTACATCAA GTGTATCATA TGCCAAGTAC GCCCCCTATT 
GACGTCAATG ACGGTAAATG GCCCGCCTGG CATTATGCCC AGTACATGAC CTTATGGGAC 
TTTCCTACTT GGCAGTACAT CTACGTATTA GTCATCGCTA TTACCATGGT GATGCGGTTT 
TGGCAGTACA TCAATGGGCG TGGATAGCGG TTTGACTCAC GGGGATTTCC AAGTCTCCAC 
CCCATTGACG TCAATGGGAG TTTGTTTTGG CACCAAAATC AACGGGACTT TCCAAAATGT 

40 CGTAACAACT CCGCCCCATT GACGCAAATG GGCGGTAGGC GTGTACGGTG GGAGGTCTAT 
ATAAGCAGAG CTCGTTTAGT GAACCGTCAG ATCGCCTGGA GACGCCATCC ACGCTGTTTT 
GACCTCCATA GAAGACACCG GGACCGATCC AGCCTCCGCG GCCGGGAACG GTGCATTGGA 
ACGCGGATTC CCCGTGCCAA GAGTGACGTA AGTACCGCCT ATAGAGTCTA TAGGCCCACC 
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CCCTTGGCTT CTTATGCATG CTATACTGTT 
CTCATGTTAT AGGTGATGGT ATAGCTTAGC 
CCACTCCCCT ATTGGTGACG ATACTTTCCA 
AACTCTCTTT ATTGGCTATA TGCCAATACA 
5 ATTTTTACAG GATGGGGTCT CATTTATTAT 
CCCAGTGCCC GCAGTTTTTA TTAAACATAA 
TGTTCCGGAC ATGGGCTCTT CTCCGGTAGC 
CATGCCTCCA GCGACTCATG GTCGCTCGGC 
CTTAGGCACA GCACGATGCC CACCACCACC 

10 TATGTGTCTG AAAATGAGCT CGGGGAGCGG 
AAGGCAGCGG CAGAAGAAGA TGCAGGCAGC 
GTAACTCCCG TTGCGGTGCT GTTAACGGTG 
GCTGCCGCGC GCGCCACCAG ACATAATAGC 
GGTCTTTTCT GCAGTCACCG TCCTTAgatc 

15 GAAGACCAAG AGGAACACCA ACAGGAGGCC 
GATTGTGGGA GGGGTCTACC TGCTGCCCAG 
CAGGAAGACC TCTGAGAGGT CCCAGCCCAG 
GAGGCCTGAG GGCCGCTCCT GGGCCCAGCC 
AGGCTTTGGC TGGGCTGGCT GGCTGCTGTC 

20 CACAGACCCC AGGAGGAGGT CCAGGAACCT 
CTTTGCTGAC CTGATGGGCT ACATCCCCCT 
GGCTCTGGCT CATGGGGTGA GGGTGCTGGA 
GCCTGGCTGC TCCTTCTCCA TCTTCCTGCT 
TTCTGCCgaa ttcgcttcca atgagaacat 

25 ccgcggcttc acctgcttca agaagtaaac 
ATCTGCTGTG CCTTCTAGTT GCCAGCCATC 
GACCCTGGAA GGTGCCACTC CCACTGTCCT 
TTGTCTGAGT AGGTGTCATT CTATTCTGGG 
GGATTGGGAA GACAATAGCA GGCATGCTGG 

30 AGCGGCCTTA ATTAAGGCCG CAGCGGCCGT 
TCGACCCGTA AAAAGGCCGC GTTGCTGGCG 
CATCACAAAA ATCGACGCTC AAGTCAGAGG 
CAGGCGTTTC CCCCTGGAAG CTCCCTCGTG 
GGATACCTGT CCGCCTTTCT CCCTTCGGGA 

35 AGGTATCTCA GTTCGGTGTA GGTCGTTCGC 
GTTCAGCCCG ACCGCTGCGC CTTATCCGGT 
CACGACTTAT CGCCACTGGC AGCAGCCACT 
GGCGGTGCTA CAGAGTTCTT GAAGTGGTGG 
TTTGGTATCT GCGCTCTGCT GAAGCCAGTT 

40 TCCGGCAAAC AAACCACCGC TGGTAGCGGT 
CGCAGAAAAA AAGGATCTCA AGAAGATCCT 
CTGCCAGTGT TACAACCAAT TAACCAATTC 
AAACTGCAAT TTATTCATAT CAGGATTATC 
TAATGAAGGA GAAAACTCAC CGAGGCAGTT 

45 TGCGATTCCG ACTCGTCCAA CATCAATACA 
GTTATCAAGT GAGAAATCAC CATGAGTGAC 
ATGCATTTCT TTCCAGACTT GTTCAACAGG 
CGCATCAACC AAACCGTTAT TCATTCGTGA 
GCTGTTAAAA GGACAATTAC AAACAGGAAT 

50 CGCATCAACA ATATTTTCAC CTGAATCAGG 
CCCGGGGATC GCAGTGGTGA GTAACCATGC 
GGTCGGAAGA GGCATAAATT CCGTCAGCCA 
ATTGGCAACG CTACCTTTGC CATGTTTCAG 
CAATCGATAG ATTGTCGCAC CTGATTGCCC 

55 TAAATCAGCA -TCCATGTTGG AATTTAATCG 



TTTGGCTTGG GGTCTATACA CCCCCGCTTC 
CTATAGGTGT GGGTTATTGA CCATTATTGA 
TTACTAATCC ATAACATGGC TCTTTGCCAC 
CTGTCCTTCA GAGACTGACA CGG AC TCTGT 
TTACAAATTC ACATATACAA CACCACCGTC 
CGTGGGATCT CCACGCGAAT CTCGGGTACG 
GGCGGAGCTT CTACATCCGA GCCCTGCTCC 
AGCTCCTTGC TCCTAACAGT GGAGGCCAGA 
AGTGTGCCGC ACAAGGCCGT GGCGGTAGGG 
GCTTGCACCG CTGACGCATT TGGAAGACTT 
TGAGTTGTTG TGTTCTGATA AGAGTCAGAG 
GAGGGCAGTG TAGTCTGAGC AGTACTCGTT 
TGACAGACTA ACAGACTGTT CCTTTCCATG 
taccATGAGC ACCAACCCCA AGCCCCAGAG 
CCAGGATGTG AAGTTCCCTG GGGGAGGCCA 
GAGGGGCCCC AGGCTGGGGG TGAGGGCTAC 
GGGCAGGAGG CAGCCCATCC CCAAGGCCAG 
TGGCTACCCC TGGCCCCTGT ATGGCAATGA 
CCCCAGGGGC TCCAGGCCCT CCTGGGGCCC 
GGGCAAGGTG ATTGACACCC TGACCTGTGG 
GGTGGGGGCT CCTGTGGGAG GGGTGGCTAG 
GGATGGGGTG AACTATGCTA CTGGCAACCT 
GGCCCTGCTC TCCTGCCTGA CAGTGCCTGC 
ggagaccatg aaccagccct accacatctg 
ccgggaattc taaagtcgaC AGCGGCCGCG 
TGTTGTTTGC CCCTCCCCCG TGCCTTCCTT 
TTCCTAATAA AATGAGGAAA TTGCATCGCA 
GGGTGGGGTG GGGCAGCACA GCAAGGGGGA 
GGATGCGGTG GGCTCTATGG GTACGGCCGC 
ACCCAGGTGC TGAAGAATTG ACCCGGTTCC 
TTTTTCCATA GGCTCCGCCC CCCTGACGAG 
TGGCGAAACC CGACAGGACT ATAAAGATAC 
CGCTCTCCTG TTCCGACCCT GCCGCTTACC 
AGCGTGGCGC TTTCTCAATG CTCACGCTGT 
TCCAAGCTGG GCTGTGTGCA CGAACCCCCC 
AACTATCGTC TTGAGTCCAA CCCGGTAAGA 
GGTAACAGGA TTAGCAGAGC GAGGTATGTA 
CCTAACTACG GCTACACTAG AAGGACAGTA 
ACCTTCGGAA AAAGAGTTGG TAGCTCTTGA 
GGTTTTTTTG TTTGCAAGCA GCAGATTACG 
TTGATCTTTT CTACGTGATC CCGTAATGCT 
TGATTAGAAA AACTCATCGA GCATCAAATG 
AATACCATAT TTTTGAAAAA GCCGTTTCTG 
CCATAGGATG GCAAGATCCT GGTATCGGTC 
ACCTATTAAT TTCCCCTCGT CAAAAATAAG 
GACTGAATCC GGTGAGAATG GCAAAAGCTT 
CCAGCCATTA CGCTCGTCAT CAAAATCACT 
TTGCGCCTGA GCGAGACGAA ATACGCGATC 
CGAATGCAAC CGGCGCAGGA ACACTGCCAG 
ATATTCTTCT AATACCTGGA ATGCTGTTTT 
ATCATCAGGA GTACGGATAA AATGCTTGAT 
GTTTAGTCTG ACCATCTCAT CTGTAACATC 
AAACAACTCT GGCGCATCGG GCTTCCCATA 
GACATTATCG CGAGCCCATT TATACCCATA 
CGGCCTCGAG CAAGACGTTT CCCGTTGAAT 
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ATGGCTCATA ACACCCCTTG TATTACTGTT TATGTAAGCA GACAGTTTTA TTGTTCATGA 
TGATATATTT TTATCTTGTG CAATGTAACA TCAGAGATTT TGAGACACAA CGTGGCTTTC 
C 

5 6. The polynucleotide of Claim 4 from which the PAb 

sequence has been removed. 

7. The plasmid vector of Claim 5 from which the PAb 
sequence has been removed. 

10 

8. A method for inducing immune responses in a 
vertebrate against HCV epitopes which comprises introducing between 1 
ng and 100 mg of the polynucleotide of Claim 1 into the tissue of the 
vertebrate. 

15 

9. A method for inducing immune responses against 
infection or disease caused by HCV which comprises introducing into 
the tissue of a vertebrate the polynucleotide of Claim 1. 

20 10. A vaccine for inducing immune responses against 

HCV infection which comprises the polynucleotide of Claim 1 and a 
pharmaceutical^ acceptable carrier. 

11. A method for inducing anti-HCV immune responses 
25 in a primate which comprises introducing the polynucleotide of Claim 1 

into the tissue of said primate and concurrently administering 
interleukin-12 parenterally. 

12. A method of inducing an antigen presenting cell to 
30 stimulate cytotoxic and helper T-cell proliferation an effector functions 

including lymphokine secretion specific to HCV antigens which 
comprises exposing cells of a vertebrate in vivo to the polynucleotide of 
Claim 1. 
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13. A method of treating a patient in need of such 
treatment comprising administering to the patient the polynucleotide of 
Claim 1 in combination with interferon-alpha, Ribavirin, Zidovudine, 
or other pharmaceutically acceptable antiviral agents,. 

5 

14. A pharmaceutical composition comprising the 
polynucleotide of Claim 1. 



15. A method of inducing an immune response 
10 comprising administering the polynucleotide of Claim 1 to a patient, the 
administration of the polynucleotide antedating or coinciding or 
following administration to the patient of a subunit, recombinant, 
recombinant live vector, inactivated, recombinant inactivated vector, or 
live attenuated HCV vaccine. 



15 



20 



16. A method for inducing immune responses in a 
vertebrate against HCV epitopes which comprises introducing between 1 
ng and 100 mg of the polynucleotide of Claim 2 into the tissue of the 
vertebrate. 

17. A method for inducing immune responses against 
infection or disease caused by HCV which comprises introducing into 
the tissue of a vertebrate the polynucleotide of Claim 2. 



25 18. A vaccine for inducing immune responses against 

HCV infection which comprises the polynucleotide of Claim 2 and a 
pharmaceutically acceptable carrier. 



19. A method for inducing anti-HCV immune responses 
30 in a primate which comprises introducing the polynucleotide of Claim 2 
into the tissue of said primate and concurrently administering 
interleukin 12 parenterally. 



SUBSTITUTE SHEET (RULE 26) 



WO 97/47358 PCTYUS97/09884 



25 



20. A method of inducing an antigen presenting cell to 
stimulate cytotoxic and helper T-cell proliferation an effector functions 
including lymphokine secretion specific to HCV antigens which 
comprises exposing cells of a vertebrate jn vivo to the polynucleotide of 

5 Claim 2. 

21 . A method of treating a patient in need of such 
treatment comprising administering to the patient the polynucleotide of 
Claim 2 in combination with interferon-alpha, Ribavirin, Zidovudine, 

10 or other pharmaceutical^ acceptable antiviral agents.. 

22. A pharmaceutical composition comprising the 
polynucleotide of Claim 2. 

15 23. A method of inducing an immune response 

comprising administering the polynucleotide of Claim 2 to a patient, the 
administration of the polynucleotide antedating or coinciding or 
following administration to the patient of a subunit, recombinant, 
recombinant live vector, inactivated, recombinant inactivated vector, or 

20 live attenuated HCV vaccine. 

24. The vector of Claim 2 which is selected from 
VlRa.HCVlCorePAb, Vtpa.HCVlCorePAb, VUb.HCVlCorePAb, 
VlRa.HCVlCore, Vtpa.HCVlCore and VUb.HCVlCore. 



25 



25. A pharmaceutical composition comprising the vector 

of Claim 21. 



26. The DNA sequence of Claim 1 selected from the 
30 group consisting of a nucleotide sequence shown in Figure 5, Figure 9, 
Figure 10, Figure 11, Figure 12 and Figure 13. 
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TABLE 3 

CODON UTILIZATION IN HUMAN PROTEIN-CODING SEQUENCES 



0 


b 


c 


d 


e 


( 


0 


b 


c 


d 


e 


f 


F 


UUU 


68 


0.35 


193 


4.5 


Y 


UAU 


72 


0.47 


153 


3.6 




UUC 


125 


0.65 








UAC 


81 


0.53 






1 


UUA 


20 


0.05 


445 


10.4 


H 


CAU 


44 


0.42 


105 


2.5 




UUG 


42 


0.09 








CAC 


61 


0.58 








CUU 


50 


0.11 




















cue 


99 


0.22 






Q 


CAA 


50 


0.26 


192 


4.5 




CUA 


30 


0.07 








CAG 


142 


0.74 








CUG 


204 


046 

w. i w 






























N 


AAU 


51 


0.34 


148 


3.5 


1 


AUU 


28 


0.23 


123 


2.9 




AAC 


97 


0.66 








Aiir 


79 


064 




















AUA 

r\\Jr\ 


16 

1 w 


0 13 

Wa 1 %J 






K 


AAA 


137 


0.45 


303 


7.0 
















AAG 


166 


0.55 






M 

m 


AUG 

nuu 


77 


1 00 

1 .WW 


77 


1.8 


























o 


GAU 


79 


0.38 


209 


4.9 


v 


nun 


"to 

jj 


0 13 


266 


62 




GAC 

w»»w 


130 

1 WW 


0 62 








nnr 

uuw 


1 L 


0 27 




















nil a 

butt 


25 


0 0Q 






F 

La 


GAA 


125 


0 40 

W. I w 


311 

W 1 1 


73 




nun 


134 


0 50 








GAG 

w*»w 


186 

1 WW 


0 60 

w.ww 






s 


UCU 


59 


0.17 


349 


8.1 


C 


UGU 


44 


0.30 


147 


3.4 




UCC 


91 


0.26 








UGC 


103 


0.70 








UfA 


37 


011 




















UCG 


25 


0 07 






w 


UGG 


56 


1 00 


56 

wv 


1 3 




AGU 


37 


011 




















AGC 


100 


0.29 






R 


CGV 


19 


0.09 


215 


5.0 
















CGC 


40 


0.19 






p 

r 


ecu 

www 




0 24 


212 


49 




CGA 


22 


0.10 








ccc 


86 


0.41 








CGG 


33 


0.15 








CCA 


51 

w ■ 


0.24 








AGA 


51 


0.24 








CCG 


24 


0.11 








AGG 


50 


0.23 






T 


ACU 


47 


0.20 


238 


5.6 


G 


GGU 


36 


0.15 


245 


5.7 




ACC 


113 


0.47 








GGC 


108 


0.44 








ACA 


50 


0.21 








GGA 


42 


0.17 








ACG 


28 


0.12 








GGG 


59 


0.24 






A 


GCU 


91 


0.31 


298 


7.0 




TOTAL 4285 RESIDUES EXCLUDING 






GCC 


119 


0.40 








N-TERMINAL METHIONINE RESIDUES 




GCA 


51 


0.17 




















GCG 


37 


0.12 



















FIG.8 



WO 97/47358 



PCT/US97/09884 



10/22 



cj 



CO 



cj 

o 
cj 



OO 



CJ 



CD > 



i — i cj in 

^ CJ 

co I— >~ 

u 
I— 

CD > 

U 
CD 

CD CD 



CO 



CD > 

+-> 
CD 

h- CJ 
O 

CJ CL 

CD > 

4-> 
CD 

a 

CD 

CD CD 

+-> 
CJ 

CJ cu 



CJ 

3 



CJ 
CJ) 

CD ^ 

CD > 
CD UJ 



t-H CD 
H IU Z 



zr: 



O 

CJ 
4-> 

CJ _ 
CD < 

-r-> 
O 

CD < 
CD UJ 



I — >- 
CD > 



lo 



CD 
CD 

< C£ 

CJ 
CJ 

CD < 

+J 
CJ 

CD < 
CD 

CJ __J 

O 

CJ 

o 

cj a. 
o 

o> 
I — 

CJ —I 

<J 

CJ 

CD <C 

CD > 

CD 
CD 



O 
CD 

I — O 
CD 

CD _ 

CJ 
CJ 



C\J 



CJ 
CJ 



C\J 



CO 



o 

CD 
4-> 



CJ 



CJ 
CD 

CD CD 
CD 



CD -M 



i-i CD 
C\J CD 

h fa q: 



oo 



GCc 


< 




cn — i 




cct 


Q_ 




4-> 
CJ 

<c 


i — 




GGg 


to 




AAG 




GCT 


< 




CAg 


o 




TAc 


>- 




ACc 


i — 




TGT 


> 




GCc 


< 


+-> 

o 

CD 


< 




TCC 


oo 




CTg 


-J 




CCc 


Q_ 




GTG 


> 




TGG 


12 


GGc 


CD 




GTg 


> 




TCc 


OO 




TCc 


OO 




ATG 






AAC 




GTg 


> 




CTg 


_J 




TGC 


CJ 




TGG 


12 




GAC 






GGc 


CD 


CTg 


— 1 




CJ 
t= 






AAc 


ZZL 




AAC 






GTG 


> 




GTG 


> 


p 


J 




GTc 


> 




TGC 


CJ 




ATG 


s: 




GTg 


> 




ATG 


21 


GAc 


Q 




TCT 


oo 




GAC 






ATG 


s: 




GCt 


< 




TCC 


OO 


GTg 


> 


1 — 1 


GGc 


CD 


1 — 1 

\ — 1 
rH 


CAG 




1~H 

CO 

rH 


ATG 


2: 


1 — i 

uo 

r-H 


CAg 


O 


1—1 


TAc 


>- 


CAT 




,-h 

Csi 


TGT 


CJ 


t-H CD 
CO h- 
CO CD 


> 


i — I 

cn 

CO 


GAc 


0 


H 
LO 


CCc 


O- 


r-H O 
r-H <C 
LO P 


>- 


cn 
CD 
03 


cc 




CTg 


1 




ACt 


h— 




TGG 






ATt 


1— • 




GCC 


<C 


CD 
CD 
03 






GAc 


Q 




GAG 


LU 




GCc 


< 




aGG 


C*L 




CTg 


1 


ATc 


\ — 1 




GGc 


CD 




TAT 


>- 




ATG 






CTg 


_J 




GGC 


CD 


ACc 






GTG 


> 




CD 
CD 
03 


C£L 




aGg 


on 




cTg 


_J 




GCt 


< 


ACc 


h- 




TAt 


>- 




aGg 


cn 




CAC 


IT- 




CAG 


O 




CTG 




o 




ATG 






CCc 


CL. 




GGc 


CD 




TCc 


00 




GTg 


> 


CCC 


cu 




GCc 


< 




TCc 


00 




TCt 


OO 




GTc 


> 




a 

CD 
CD 


CD 


ATC 


1— 1 




TCt 


OO 




DLL 


Lt_ 




GTc 


> 




GTG 


> 




TGG 


12 


CJ 

u 


oo 


r-H 

CO 


TGC 


CJ 


i— l 

o 

r-H 


ACC 


I— 


r-H 
C\J 
r-H 


CAt 


31 


r-H 

r— 1 

^ 


cTg 


_J 


t-H 
LO 
r-H 

. 


CAC 


IC 


tcC 


oo 


r-H 

CvJ 


CTg 


— 1 


r-H 

CD 
CO 


TTC 


Li_ 


r-H 
CD 
CO 


GGC 


CD 


r-H 

C\J 

^" 


GCC 


<c 


1 — 1 
OO 


GCC 


<c 



CD 



03 
03 



CJ 
CD 

CD CD 



' — • CD O 
»-h CD 

l — 

LO CD > 

O 
CD 

CD CD 

4-> 
CJ 

CD <C 



O . 

cn 
f— 

o . 

S: 

CD 
I — 

CD : 

r-H CD 
OO h- 
r-H U 

r-H CD 
h- 

LO CD 



WO 97/47358 



PCT/US97/09884 



11/22 



cj 

CJ 
CJ 

h- oo 

cj 
cj 

CD < 

CD > 

CJ) 
CD 

rO CC 
CD 

CD _ 
CJ 

o 

CJ 

t-H O O 
t-H 

^ c_> 

t-H CJ 

CO +-> oo 
u 

CJ _ 
CD < 



u 

CD 

CD CD 

CD > 

O 



CJ 
CJ 

Sx 

CD 
CD 



O 
CJ 

4-> CO 

CJ 
CD 

CD CD 



t 

CJ 
CJ 

< 



CJ — I 

o 

CJ _ 
CD < 



CJ 

CD <C 



5- 

o 

CJ 

CJ Q_ 

cn 

CD _ 
ro 



O 
CD 



CD : 
cn cj . 



CJ 

tu. 
u 

CD 

CD CD 
CJ 

lo < ^ 

t— I CJ 



oo 

CD > 
CJ 



CJ o 

CJ 
CJ 

CD <£ 

CJ 
CD 

CD CD 



CJ 
CJ 



00 



CD 



CJ 
CD 

— CJ 



CJ -J 

CJ 
CJ 

CD <C 



CJ 

CJ O- 



CJ 
CJ 
<C I 

CD 



CJ 
CJ 

t— I +J CO 

^ o 

kO t Ll 



o 

<C 1 — 

CD 
H CD 
^ < 



CJ 



O 

+-> OO 
CJ 

CJ _ 
CD < 

< s: 

CD ^ 

to ct: 



CD 
< 
CJ < 

CJ 

<3. 

CJ 



CJ 

a 
+-> 

CD 
CD 

i 

CJ 
CJ 



oo 



00 



CJ 
CD 



CD 



«— » CJ 
C\J I — oo 



1 • < 

cn cj 

CnJ < 



CO 
CO 



CJ 

CJ 

CJ 
CJ 
CD 

-M 
CJ 
CJ 

CD 



u 

CD 

CD CD 



CD 

h- CJ 

03 
CJ 

CJ Ql. 



1= 



an 

CD 

ro C£ 

CJ 

< 

CD Q 



CO 



4-> 
CJ 

< h- 
CD 

o 

<C I — 

o 

CD 

CD CD 

CD > 

CD 

h- 

CD > 
CD > 



CJ 
CJ 

<c 

CJ 



cn cj 

CO CJ Qu 



CJ -J 

CD 
1 — 

CJ _J 

r? 

CJ _l 
CD > 

cd Q 

LO CJ 
H < (- 

CD 
LO < 
t CD LU 



CD 



CJ 
CJ 

O 

1= u. 

a 

CD 

CD CD 



CJ 
<C 

CJ 
CJ 



OO 



TGC 


o 


ATC 


t — i 


CAa 




CJ 
CJ 

+-> oo 


GGc 


CSS 


ccc 


Cl. 


CCc 


Q_ 


ro 
CJ 

CJ Qu 


TCT 




GGc 


CD 


GCc 


<C 


CJ 

<C 1 — 


TCc 


to 


TGG 




TAt 


>- 


CJ 


AAC 




GGc 


CD 


CAC 


_n 


CJ 
CD 

r— CJ 


TTC 


Ll_ 


CAg 


O 


TGG 


3: 


£>. 


AAG 




GCc 


<c 


TGC 


CJ 


U 
1 — 

CD > 


AAG 






u_ 


TAC 


>- 


4-> 
CJ 

CJ Cl 


O) 




CD 




r— ♦ ro 




t-H CJ 



CJ 

CD 

CD 
CD 



t-H CJ 

t-H ej 
u 

CD 



CJ 

cS Q 

CJ 
CD 

CD CD 

CD 
CD 



C\J 



LO CD 

t-H 

00 < 

t-H 1 — >- 



00 



CM 



S3 



CH t-H 



Q CO 



CJ 
CJ 

CD 

3 



a- t-H CD CD »-h 



CJ 
CJ 

< h- 

CJ 
CJ 

CJ Q_ 

1? 

CD 



^ I — 



Cd 



CO CD 
CO I — CJ 



t-H O 

eg CD 

^ CD CD 



CJ 

CJ 
CD 

CD CD 



CD 
CD 



CJ 

$ z 

CJ 
CD 

CD CD 
CD 

5a 

CJ 
CJ 

CJ CL. 
t-H <J 

VO CJ 
i — I CJ Q_ 

r-H CD 

00 CD 
■si- ro Cd 



WO 97/47358 



PCT/US97/09884 



12/22 





CJ 








cj 










CL 






CJ 


















CJ 






cj 








CJ 








<: 


h- 






CD 
















o 


—I 






o 








CJ 








<C 

cj 


h- 






< 

<L 


2: 






CJ 








< 








CJ 








CD 








CD 


CD 




1 — I 


+-> 




T 1 




CJ 




rH 


i— 1 


CD 


«^ 


C\J 


r— » 


U 




r-H 




CD 




CO 


lO 


CD 


CD 


CO 




CJ 








CD 










f n 
























»— t 






















CJ 


















CJ 














CJ 








CJ 


CL. 






CJ 








CJ 








CJ 


CL. 






CJ 








CD 








CD 


CD 






(J 








CD 








CD 


CD 




i— i 


+J 




r-4 


QO 






o 


r-H 




CJ 


C\J 








\ 


t— 1 


CJ 




r-H 




CJ 




O 


LO 


< 


h- 


CO 



CD 
CD 



ro 
CJ 

CJ CL. 

U 
CD 

CD CD 



+-> 
CJ 

I — CO 

CJ 
CD 

CD CD 



CJ 

I. 

CJ 
CJ 

CJ 

£>- 

CJ 

3 1- 

CJ 
CJ 

CD < 
CD LU 
CJ CL. 

5i 

CD 

CD _ 
CJ 

t=u_ 

CJ 
CD 

I — CJ 



CJ 
CJ 
CJ 

CD CJ 



nj 

CJ Cu 
CJ 

CJ 

< >- 



CJ 

< 

CJ 

CD 
CD 



r-H CD 

CO CD aL 
C\J < 



cn 

CO 



CJ 

< 



LO 



ra 
CJ 

£>- 

Cl3 Q 

a> 

CD > 

g 

< s: 

CJ 
CD 



GCt 


< 




CTg 


AAt 






GAG 


CTg 


i 




TCt 


AGG 


or 




AGG 


CAC 


rn 




GAc 


GAG 


UJ 




AGG 


GTG 


> 




GAC 


GGc 


CD 




GAG 


GGc 


CD 


r-H 

CM 


ATt 


GTG 


> 


r-H 

t—l 
CO 


AAC 


TAT 






TGc 


JG 






cn 

CD 



CJ 



o 



CD 

u 

CJ 
CJ 

r-H CJ 
C\J CJ 
CXJ <C 



OJ 



< 21 
CD 

O 
I— 
< 



t 

CJ 

CJ 
CJ 

< 



CO 



CJ . 

CJ 
<C I 

CJ 
CJ 

< I 

CJ 

t. 

CJ 
CJ 



CO 



cj 

CD 



CJ 



CJ 
CJ 



cr> 

C\J 



CO 



CL. 
CJ _l 

U r-H 
I r-H 

<C < — " CO 

CD t-H 
<£ CO 
UOCTi 



CD 

I— 

CJ — t 

o 
cn 

< 

CJ Or 

CD > 

H-> 
< 

CD O 
CD > 



CD 
I — 

CJ —I 
CD 

CJ —J 

CJ _J 
CD 

4-> 
<C 

I — >- 

CD 
< 

CD LU 

LD 
CD 

I — 3: 
cn 



<C CO 

CJ or co 



fO QC 
LU 

a 

CD 

CD CD 

CD 
CD 

CJ 
CJ 

< 1 — 

CD 
CD 



CD 
CD 



CD 

CD LU 

H-> 
CJ 

< I— 

u 

CJ 

<C I — 

o 

CJ 



CO 



CJ — I 
CJ 

CJ —J 

CJ 
CD 

CD CD 



CJ 
CD 



CJ 
CJ 



CO 



CD > 

CD > 
O 

CD <£ 



CJ 

i — 



CD 
CO 



CJ 



CNJ 



r-H CJ 

CO CD 

CM 1 — 

S o 

CO CJ 

r-. CD 



CJ 



ao 

(XI 



CO 



CD 
I — 
CJ 

CJ . 

CJ 

U r-H 

CJ o 
CJ CL. CO 

* — 

O rH 
U 



CJ 

< 
o 

CJ 



CO 



f2 

CJ 

CJ 
CJ 
CD 

CJ 



<£ CO 



4-> 

CJ 

\— CO 

CJ 
CD 

CD CD 

cn 

CD > 

O 
CD 

CD CD 



ra 
ra 



CJ 
CJ 

CD < 



CD Q 



CJ 

CD < 
CD 

CJ _J 

CJ — I 



+JOOCDUaCDl- > 



^J- ,CD 
CO h~ 
^ CJ 
r-H 

CNJ CJ 

St, 



WO 97/47358 



PCI7US97/09884 



13/22 



o 



oo 



cj 
cj 



OO 



o 

CD 



cj 



CJ 

CD Q 



cj 

CD 
I — 

CD > 



CD > 

4-> 
CD 

I — O 

fO 
CJ 

CJ Cu 

cd 

CD > 

4-> 
CD 

o 

CD 

CD CD 

4-> 

CJ 

CJ CL. 



< 

i— I 

^ CJ 
CO I — >— 

CD > 

O 
CD 

CD CD 



a 
l — 

CD > 

s« 

CD 

CD _ 

CD > 
CD LU 



CO 



CJ 
CJ 

< 
< 

CJ 



cn 



I — 



in m 



J2 
a 

CJ 



4-> 
CJ 
CD 

4-> 

CJ 
CD 

s 



LO 



AAC 


2: 




GCc 


< 


AGG 






GCT 


< 


GCC 






GCt 


< 


GCt 


< 




GGc 


CD 


CTg 






GTg 


> 


ACc 


h- 




CTg 


_J 


CCC 


a_ 




£ 
a 


_J 


ACc 


H- 




GAc 


CD 


CTg 


— 1 


r— 4 


GTg 


> 


O 
CJ 




i — 1 

r— 1 


;at 





CJ -J 

CJ Or 

O 
CJ 



CO 



4-> 
CJ 

cj a. 

£>. 

cn 

cj _J 



cd : 

cn 
f— 

CJ 
CJ 

I— 

I— I 

CJ 

I — 

CD 



h- 00 
CJ 

t-h CD 
CT> CD CD 



CJ 
CJ 



O0 



3 

CJ 
CJ 

<c 
o 

CJ 
CJ 

CJ 
CJ 



CJ 
CD 



CJ 



CJ 
CD 

I — CJ 
CJ 

< 

CD C3 

CD i-H 
<C CO 

UOH 



CD <£ CM 
CD 

CD > 

CD 
CD 



CJ 
CD 

h- CJ 

CD 
CD 

ra ct: 

CJ 
CJ 



00 



CJ 

0 



00 



?£>- 

CD 
4-> 



CNJ ■ ^ 

^ CJ i-H 

H CD CNJ 

CO 4-J O 1— I 



3 



CD 



CD LU 

CD 
CD 

fO CC 



CJ JC 
CD 

CD 

CD _ 
CJ 

u 

CJ 

< h- 

u 

CJ 

< h- 

o 

CJ 

< h- 

CJ 
CJ 

CJ Q_ 
CJ 



CJ 
CJ 
4-> 

CJ 
U 



cd co 

CNJ I — CJ CO 

CJ _J 
CJ 



cj 

CD 

CD CD 
CD > 

CJ 

o 

CD < 

4-i 
o 



cd : 

4~> 
CJ 

i, 



h- >- 
CD 

CD _ 

ct: 

CD 

<o ct: 
cj 

CJ 

CJ CL. 



CO 



CJ 

o 



00 



00 



00 



CO 

1— I 
CO 

i-H 4-3 OO 



CJ 
CD 



CO I — CJ 1 — I 
.CD 



CNJ CJ 



CO 



CJ 

t= 

CJ 
CJ 



CNJ 



CO 
CO 



00 



CD 
CD 



C_> 

<C 

CD 
h- 
<C 

CD 

£ 

CD 
CD 



a 

CD 
CD 

CD : 

£5 

CD : 
CD 

5: 

CJ 

<t 

CD i 

cd : 
CD : 



CD 



LO 
21 r-l 



LO 



CJ 

CD < 

CD t-H 

5 a ^ 

CJ »— I 

CJ .-H 

CJ Cl LO 



CD 

u 
o 

CD <C 
CD 

CJ 

a 

CD 

CD CD 

CD 
I— 

CD r> 

CJ 
CJ 



CJ 

CJ _ 
CD < 

CD 

CD 
CD 

CJ 
<C 

CJ IC 

CJ 
CD 

CD CD 

4~> 

CJ 

h- 

CD > 

4-> 
<£ 
O 

O 
CD 

CD CD 



<C 1-4 

CD 
CD 

fO Cd 

CJ —I 

CJ —J 
CD 

6a 

CJ 

o 



00 



^ I — 



o 
h- 

CD : 

CD 

h- 

cd : 

cn 
a 



00 



CJ 
CJ 

CD <C 

CD 
I — 

CJ _J 

CJ 
CD 

CD CD 

4-> 
CJ 

CD < 

CJ — I 

CD 
I— 

CD > 

a 

CD 

CD CD 

CD 
CD 



CO 



CJ 
CJ 

CD <£ 



CJ 
< 

cj in 

CJ 
CJ 

CD < 



WO 97/47358 



PCTYUS97/09884 



14/22 



cn 



GGc 


v — J 




AAg 






AAt 






TCT 


OO 




GGc 


CD 




o 

CJ 

CD < 




CJ 

<.- 


GTG 


> 




CAG 


Or 




TGC 


cj 




TCc 


OO 




TGG 






TAt 


>- 




cj 


TCt 


00 




GCc 


<C 




AAt 






AAC 






GGc 


CD 




CAC 


zn 




o 

CD 

I— CJ 


GTc 


> 




TCt 


oo 




CTG 


1 




o 

1= 






CAg 






TGG 


3: 




U 

£^ 


TAt 






GGC 


CD 




GCC 


< 




AAG 






GCc 


< 




TGC 


CJ 




u 
1 — 

CD > 


ACC 






CCT 


Cl. 




ACT 






AAG 








Lt- 




TAC 


>- 




4-> 
CJ 

CJ CL 


ACc 


h- 




tec 


oo 




AGG 






GTg 


> 




AGG 






CCa 






O 
CD 

CD CD 


GGc 


CD 




DLL 


u_ 




AAC 






TAt 


>- 




GAC 






AGG 


C£ 




4-> 
CD 

1 — CJ 


GAt 




i — l 
i— » 

CsJ 


TTC 


u_ 


H 

CO 
CM 


ATC 


i — i 


r-H 
LO 
CsJ 


TTC 




T— 1 

CM 


ATT 


r— t 


r-H 

cn 

C\J 


CAg 


r-H 

r-H 
O CO 


u 



CO 
CO 



LO 



CD : 
o 

CD 

CD CD 



CJ _ 
CD < 



CO 
CO 



CJ 

o 



cn 

00 CO 



CJ 
CJ 
CD 

CD : 



< 

CJ 

CD 
CD 



r-H CD 
LO h- 

cj — J 
u 

CJ 

CD < 



r-H CJ 
r-H CJ 

CO CJ 



r-H CJ r-H 

r-* <C co 
CO CD Q cn 





III 


U. 




cn 

CD 

<0 




CTg 


_J 




CD 
CD 

CO 




CTg 


— i 




ACC 




ATG 






ACC 




GTG 


> 




CAG 




£ 






tcC 


r-H 

00 

r-H 


cTG 




r-H 

CD 
CM 


GCc 


\ 
r-H 

LO 


GTg 


> 


r-H 
CD 

CD 


CAT 



CJ 
CJ 

4-> OO 

CJ 
CD 

CD CD 



CJ 
CJ 

<C r— 



CJ 

CD <£ 



t Lu 

U 
CD 

CD CD 



CD _ 

CJ 
CD 

I— O 



u 

U 
CJ 
CD 



4-> 
CJ 



00 



00 



CJ 
CJ 

4-> OO 

cn 

CD _ 

CJ 
CJ 



OO 



CO Ctl 













o 






CD 


CD 




GT 






s 












cn 












4-> 


















O 


OO 




CJ 


—J 






i — i 




h- 




r-H 






r-H 


CJ 




r-H 


CJ 




CM 






^J" 


CJ 




LO 




< 


CM 


3 


Or CM 


h- 


oo 


CM 






^ 


















r-H 


CJ 




r-H 


CD 




r-H 


CJ 










CM 






CO 


CD 


in 


CO 




i — i 






UJ 




CD 



00 



h- CJ 



CO 
CM 



1 



CJ 

CD < 



< 
CJ 

CJ 
CJ 
< 

CJ 

CJ 
CJ 
CJ 



3 

CJ 
CJ _J 

(J 

CJ _ 
CD < 

•M 
CJ 

CJ Q_ 
CD > 

S~ 

CJ 
CD 

CD CD 

I— 

CD 

I — CJ 

ro 
CJ 

cj a_ 



cn 



u 
CD 

CD CD 

CD 
CD 



CJ 



U 
CJ 

< I — 

cj 

CJ 

CJ CL 

cn 

CD > 

CJ 
CD 

CD CD 
cn 

CD _ 



CD 
CD 
h- 

CJ 



CJ 
CD 

CD CD 

cn 

oo 
u 

CJ 

CJ Q_ 

a 

CJ 

CJ Qu 

CD 
CD 



lo a 

CO CJ 



LO 



CJ 



CJ h- 

o 

CD CD 
CD 

CD 

CD 

CD > 



CJ 
CJ 



CJ 
CJ 

r? 

CJ I 

CJ — I 

cn 

CJ _J 
CD > 



CJ 

< h- 
CD LU 



r-H 




r-H 


CJ 






CD 




CM 


CJ 




co 


•— i CO 


CJ 


Or CO 


-M 


OO 


r— i 


i — l 


u 


r-H 


ro 




CM 


CD 


CJ 


CO 


CJ 




o 


q_ cn 


CJ 


cl. cn 


CJ 


CL. 


r-H 



cn 

CD 

LL 



< 

CD 



WO 97/47358 



15/22 



PCT/US97/09884 



Mc 




cct 




GAC 




TGc 


o 


CAt 


zn 


GTg 


> 


CCa 


Q. 


AAG 




ATG 




CCC 


Q_ 


CD 
CD 
ro 


Of 


TGC 




GGC 


CD 


DLL 


Ll_ 


AGG 


CC 


GGc 


CD 


TGC 


CJ 


CCc 


CL 


TGt 


CJ 


GAc 


CD 


ACc 


1— 


ACc 


(— 


ACt 


h" 


cTG 


_J 



CD 
CD 
<C 

CJ 

6 
I 

CJ 

1 — 
I — 

CJ 

Sc 

CJ 

u 



CO 



CT» CJ 
CO O 
\ CJ 



CD 
CD 



CO 



CJ 

1= 



ACC 


r-H 
I — «-< 


TGC 


CO 
C\J 
CJ r-H 


CCa 


en 

C\J 
Q_ «-H 


ACc 


f— 


Oil 




ACC 


h- 


GGc 


CD 


TGC 


CJ 


GGc 


CD 


cTG 


— 1 


TCt 


GO 


CCa 


G_ 


ACt 




ACC 


H— 


GGC 


CD 


TAC 


D- 


tcC 


00 


AAC 


zz: 


TGt 


CJ 


CAC 


=c 


AAC 


^ 


AAC 


zz. 


AAG 




TGG 


3 


ATG 




GGC 


CD 


ACC 




CTg 


_j 


TGG 


IS 


GCt 


< 


TAC 


>- 


AGg 


ai 


ACc 


r— 


GGc 


CD 


ACc 


1 — 

1 


TAC 


>- 



GAG 


UJ 


ACt 


h- 


ATC 


t—i 


GGc 


CD 


ACc 


h- 


cTG 


— 1 


CD 
CD 
ro 


C£ 


TCc 


00 


GGc 


CD 


ACc 


h- 


CTG 


1 


ACT 


1 — 


TGG 


3: 


CTg 


1 


TCC 


OO 


AAC 




CTG 


1 


CTG 


1 


TGC 


CJ 


CCc 


Q_ 


0 

CJ 
CD 


<C 


GCc 


< 


CJ 

0 
+-> 


(X) 


cct 


Q_ 



0 
I — 

CD > 

CD 
I— 

CD > 

-M 
CJ 

CD <£ 

4-> 

O 

h- GO 

CJ 
CD 

CD CD 

CD 
I — 

CD > 

CJ 
CD 

CD CD 



I — >- 



LO 



LO 

CO 



4-> 

CJ 
CD 



2: «-H 



LU 



CD 
I — 
CJ 

CD 
CJ 

1 

CD 
H- 
CD 



CJ 
CD 

CD CD 

O 
CD 

CD CD 
CD Z> 



CD 
I — 
CJ 

CD LU 

+-> 
CJ 



00 



LO CJ 
CO CD 

\ h- 0 

1— I 

00 CJ 
CD CD 
1 — 1 CD CD 



GO O 
CO CD 
^ CD CD 

r-H 

r-H <£ r-4 



CJ CJ 

cj 

^ CD <£ 

r-H 

CD CD 
CnJ <C 
r-H CD UJ 



CNJ 



co 

CnJ 



O 

CJ Q_ 

CJ 
< 



LO 



CD 

a 

CD 

CJ 
< 
CJ I 

1, 

CD 

< 

CJ 
CD 



CD 
I — 

CJ . 

CJ 

<i I — 

CJ 

<c I — 
o 

l=U- 

CJ 
CJ 

I— 00 

o 

CD 

1— CJ 

ro 
CJ 

CJ Q_ 

CJ —J 
CJ 

£~ 

cj 0 



lo 



CJ 

r-H 

CO CJ 



ro 
ro 

4-) 4c 
U 

CD < 

CD O 
4-) 

O _ 

CD < 

CD 

CJ — 1 

CD 
I — 

CJ — I 

CJ —I 
CJ 

t=u. 

CD 
h- 

CJ _ 

cr> cd 



CO 



LO <C 

r-H h" 


LO 
>- i-H 


I— 

CJ 


I 


CAq 


or 


CTG 


1 


GTG 


> 


GTg 


> 


GAt 




TAt 




GTG 


> 


GAG 


LU 


ATt 

n 1 L 


1 — 1 


TGG 


zs. 


AAf 




f 




CAG 


O 


ATC 


1 — 1 






GTG 


> 



H- >- ^ h- CJ 



CO CD 
^t- CD 



CD 
LO 



r-H h- >- I" I 



eg cd 



CO CD 
CO CD 

r-H ro a: 



^ CD 



CD 

\— 

CJ 

r-H 

CD 4-> 
LO <C 

r-H CJ 



CnJ 
LO 



1= 



LO CJ 
LO CJ 



OO 



WO 97/47358 



PCT/US97/09884 



16/22 



TCC 


Ll- 


TGc 


O 


CD 

i— cj 


ACC 




GAC 


O 


TCc 


00 


ro 

o 

O Q- 


AAg 




ACT 


h- 


TTC 


Li- 


CJ 
CD 

\— cj 


CCc 




cTG 




311 


Li- 


o 

1 — 


GGc 


CD 


GTG 


> 


CCa 


Q_ 


cj 

CJ 

< r— 


GTg 


> 


ACt 


r- 


GAC 


O 


CD 


ATt 


i — i 


TGC 


cj 


GGc 


CD 




AGG 


DC 


ATc 


i— < 


cct 


Q_ 


cn 
h- 

CD > 


CD 
h- 
<C 




CD 
CD 




1 u 




O 

r-H CD 


r-H O 





CJ 
CD 

I — o 

fO 
CJ 

CJ O- 

cj 

CD 

CD CD 

+-> 
O 

< h- 

cj- 
cj 

< I — 
cj 

£ >- 

u 

CJ 

CD < 



CO CD O 



CO 



CD 
CD 



CD 

CJ — I 
CD 

CD _ 



U 
CJ 

I — 00 

CJ 
CD 

CD CD 

4~> 
CJ 



ro ai 
cj 

CJ 

CJ £X 
CJ — I 

£^ 

CJ 

cn 
h- 

CJ — J 

CD 
CD 



1 — «3 



oo 



Co 

-r-> 

ro ; 



CJ 
CJ 

H<h 

C\l 

^ CD «— « 

r-H <T C\J 

CO <? ^ r-H 



cj 




CD 




CD 


CD 


CD 




CD 




ra 




CD 












CJ 




r— 




CD 




O 




CD 




CD 


CD 






1 


















>- 


a 




CD 




CD 




cn 




r-H CD 









r-H U 

C\j CD CD 



i-H O 
OJ < I 

CJ 
CJ 

CJ Q_ 

s*r- 

CD CD 



f2 

CD 



CD 
CD 



CJ <C 
CD 



CJ 
CD 

CD 

CD 
ra 

CD 
CD 
I — 

£ 

CJ 

CD 
CD 

CD C£ 
<C 

CJ 

CJ oo 



CJ o 



u 

CD 

CD CD 



S~ 

u 

r-H CJ 

CO CD <t 

r-H CJ 

CO CD 

r-H CD CD 



00 



< 

CJ 

CD 

CD 



U 
CJ 
< 

s 

o 
o 

4-> 

a 

CD 



CD 
h- 

CD > 
CJ 



o 

<C I — 
CJ 

CJ 
CD 

CD CD 



r-H +-> 

CO CJ 
H <C 

r-H CD 

CO CD 



LO 



CJ 

cj m 



CJ 

u 
o 

CJ 

o 

CJ 
CD 



CJ 



OO 



r-H O 

O CJ 

r-H h- 



OO 



1 — CJ 



r-H CO 
CD CJ 
CO CJ Q- 



CD Q 

U 
CD 

CD CD 
CD 

ra C£ 

U 
CJ 

< I— 

r-H CD 

eg | — 

r-H CD > 

r-H cn 

< 
CO CD 



CJ 

5x 




CTg 


i 


CD 
r— 

CJ — 1 




GGc 


CD 


CD 

CD ^ 
ro C£ 




GTg 


> 


CD > 




CAG 


O 


a 

CD 

CD CD 




311 




cS Q 




ACc 




CD > 




GTg 


> 


»■ 

cn 
< 

CD LU 




GAG 


LU 


TTC ACt 
F T 


511/171 


aGG GAT 


Q 
on 






CTg 


—i 


cn 
< 

CD LU 




CTg 


-j 


+-> 

CJ 

cj a_ 




CCc 




CJ 
CJ 

CD < 




AAg 




+-> 
CJ 

cj a. 




TGC 


CJ 


cn 
h- 

CD > 




GCc 


< 


CD 

5a 




cct 


Q_ 


CJ 
CD 

1 — CJ 




GCc 


< 


ro 
CJ 

cj a_ 


r-H 
CO 
r-H 


TAt 


>- 


CJ 
CD 

— o 


r-H CD 

OO CD 

^ < 





< 

CNI 

6 

LL 



WO 97/47358 



PCT/US97/09884 



17/22 



CD 
\— 

CJ _J 
CD > 

3.- 

CD > 



lo 



o 

CJ 

CD <C 

CD 
I— 

O — I 

CD 
CD 



cj 
u 

CJ 



lo 



cn 



CJ> 

o cx 

CD 

CD LU 

-M 
CJ 

CJ Q_ 
CD 



CD 
CD 

cj 

CJ> 

CD <c 

a 

CD UJ 

< — 1 -M 
t— I CJ) 
CM CD <C 



03 

CJ 
CJ 



CJ 
CD 



lo 



CO 

CM 



o 
I — 

J2 

o 

CJ 
CJ 

u 

4-> 



GO 



CD CO 
1 — CJ CD 

CJ 

CJ CL_ 

O —J 

CJ o 
u 

CJ 

h- 00 

u 

CD 

CD CD 

CT> 
r— 

CD > 

4-> 

CJ 

CJ O- 



CJ 



i—l CP r-H 

0O <C CD 
H U OOvl 

r-H CJ 



CJ 

I — 

<C >— • 

CJ 

CJ 

CO 
CJ 

CJ CL_ 
CD 

cS LU 
+-> 

CJ —J 

CJ 
CJ 



cn 
h- 

CJ 
CJ 



CJ 

o 

CD 



cn 
h- 

CJ -J 

CJ 

ci Q 

4-> 

CJ 

CD < 



O 

T~l CJ 

cn CD 
co 1 — 

cj 

CJ 



LO 



LO 



LO < 
CM CD 

H -M 
LO CJ 

cj 

CJ 
CJ 



h- 

cd : 

CD 



3 

CD 

5 

CJ 

i 

CD 

CD 
CD 



LO 



LO 



CJ 
CD 

CD 

+-) 
CJ 
I — 

CJ 
I — 
CD 



CD 
CD 



CJ 

CJ D_ 

cn 

CD _ 
cn 

CJ 

CD < 



cd : 
cd : 



3 



CD 
CD 



s 

CJ 
CJ 
CJ 

u 

CJ 
CJ 
CD 



t-H CD 

cn < 
a: cm cd uj 



CO 



CJ 



CJ 



r-H O 
r-H CJ 
CO < 

CJ 

I— 

CJ 



oo 



CD 



r-» CJ 

CO CJ O- 

«-H CJ 

CO CJ 

cn cj cl. 



CJ 

cj a. 
u 

CJ 

cj a_ 

CD > 

£>- 

CJ 

CD Q 

a 

CJ 

cj a> 

r-H CJ 
CO < 

CO CD O 



t-h CD 

cn ^ 



LO 



CD O <£ 



CJ 
CJ 
CD 

CJ 

CJ 
CJ 



LO 



ro 
CJ 
CJ 



CJ 

CJ Q_ 



< 

cj : 
a> 

CD 
CO i 

CJ 

s*. 

u 

CJ 

< I 

CJ 
CD 



CJ 
CJ 



LO 



o 

CO 



LO 

CJ 
CJ 

< I— 



r-H CJ 

OJ CD 
C\J CD CD 

r-H cn 

LO CD 

co < cc 



CJ 

a 

CJ 

< f— 

r-H U 

^- CJ 
cm CD < 

r^H CD 

CM <£ 

r-K < ^ 



o 

CD 
CD 

CJ 
CD 
CD 

CD 

CD 

CD 
<£ 
CJ 

CD 
CD 
fO 

r-H CD 

CO CD 
CM h- 

r^H CD 

CO h- 

r-s. cj 



4-i 

CJ 

CD < 

CD 
CD 

(O CC 
C7> 

CJ _l 

CJ 
CJ 

CJ Q_ 
CD LU 



CJ 
CJ 

O I — LO 



CJ 

t=u- 

CD 



2 



CD 



CD 



CJ 

CD O 



r-H CD 

00 f— 
3 CM CJ 

r-H CJ 

I — 

I CO < M 



CD 
CD 

<C OH 

CJ 
U 

+-> LO 

a> 

CD 

CD _ 

CJ -J 
CJ 

<: ~ 

CD 



CD 
CD 



CJ 
CJ 

f— LO 

CD LU 

CJ —J 
CD 

CJ — I 

u 
cj 

CJ CL 

ro 
CJ 

CJ Q_ 
CJ 



co 



LO 

o 

^ r-H 



CD 

1 

CD 



CD 
CD 

ro Cd 

CJ 
CJ 

CJ Q_ 

ro 
CJ 

CJ Cl_ 

CJ 
CJ 

CJ Q_ 

O 
I — 

a 

CJ 

CJ> CL. 

ro 
CJ 

cj a_ 

CJ 
CJ 

CD <C 

< s: 

CJ 

u 

CJ 

CJ CL. 

CJ 
CJ 

CJ Q_ 

CJ — I 

CJ 
CJ 

CJ Q_ 

CJ 
CD 



O 
CO 

t-H +J 
O CJ 

cn CD <C 



CM 
CO 



LO 

cn 



CJ 

J2 >- 

CJ 
CJ 

4-> LO 



0O O 
CO CD 
^ CD CD 

r-H 

CM 4-> 
HUI 



WO 97/47358 



PCT/US97/09884 



18/22 



ACC 




CAG 


o 


CTg 


_J 


GAG 


LU 


AAG 


^ 


GAC 




CCC 


O. 


GAG 


UJ 


ACc 




CCT 


o_ 


ccc 




tct 


CO 


GCC 


<c 


CCc 


Q- 


ATG 


^ 


GTc 




CTg 


_i 


GCC 


< 


TCC 


CO 


ACt 




GAG 


UJ 


ACG 




TCC 


CO 


TCc 


CO 


GCT 


< 


GCc 


<c 


TAC 


>~ 


TGG 


3= 


cTG 




ACt 


1 — 


TCc 


CO 


TCc 


CO 



co O 
\ CD <£ 



r-H CJ 

i — I I — CO 



CT> CJ 
CO CD 
\ CD CD 
i — i 

r-H U 

+-> CO 



CD 

<C 

CD UJ 



CO 
C\J 



CO 



C\J 



u 

CD 

CD CD 



TCc 


CO 


GAC 


Q 


GAt 


Q 


GTc 


> 


GTg 


> 


TCt 


CO 


ACt 


h- 


GCt 


< 


GGc 


CD 


TCC 


CO 


TCt 




AGg 


CXl 


GAg 


LU 


TCc 


c/i 


GAC 


O 


ACt 


h- 


GGc 


CD 


GGc 


CD 


CTG 


1 


TCt 


m 


GAt 


Q 


GTg 


> 


c_> 
u 

-M 




GAT 


O 



co cn 
CO h- 
CD 

i— I 

CO +-> 

CD CJ 

r-H < ! 



CO CJ 

CO CD 

^ CD CD 

r-H 

CJ 

r-H I 

r-H I— U_ 



^ h- CO 

r-H 

O ro 

CXI CJ 

r-H CJ Q- 



OJ 



o 

4-> CO 

cn 

CJ — I 
CJ 

CD Q 

+-> 

CJ 

CJ Q- 
O 

cS Q 

U 
CD 

CD CD 

4-> 

CJ 

CJ Cl. 

CD 
< 

CD UJ 

i 

U 
CD 
CD CD 



ro 
TO 

4-> * 

CJ 
CJ 

I — CO 

CJ 
CD 

I — CJ 

CJ 
CD 

I — CJ 

CJ 
CJ 

CD <£ 

cn 
I — 

CD > 

+J 
<C 

CD O 

CD 
<C 

CD UJ 



CJ 

+-> CO 



CO CD 
CXI <c 



CXI o 
CO CJ 

r-H CD < 



o 

CM 

6 



WO 97/47358 



PCT/US97/09884 



19/22 



CO 



CTG 


1 


ACc 




GAt 


cj 


CJJ 




E 




TCt 

1 L/ L 


CO 


AAG 




ACc 


1 — 


cct 




GCg 


< 


AAg 




CAC 


nz 


tcC 


on 


GCC 


<c 


GTg 


> 


AAG 




TCC 


CO 


ATC 


i — i 


GAG 




TAt 


>- 


CAt 




GTg 




O) 




CAC 


31 


GAG 


LU 


GTc 


> 


CTG 


_i 


ACt 




GCC 


<c 


AAC 




GCt 


< 


ATG 




AGg 


C£ 


TCC 


CO 


TCt 


CO 


GTg 




GCt 




AAC 




GAC 


Q 


GCc 




CAC 


3Z 


GCt 




TGt 


cj 


CAC 




Ml 


U_ 


AAG 




CCc 


Q_ 


AAG 








4-> 




cj 




O 




4~> 




rH CJ 





CJ 

cj 
o 



CL- i-h 

CO 



rH CD 



rH 

CO 



cj 
< I — 

CD : 



o 

CD 



CJ 




CD 






CD 




i — i 


f— 
CJ 


— i 






cn 












CT 


-J 


o 


i 




f 


o 




CJ 








CJ 




CJ 






8 


CD 


< 


r- 


CO 






CJ 










CD 


CD 




3 






CD 


CD 


CD 




7ZL 






+-> 




CJ 








CJ 




o 








< 


I— 


4-> 


CO 




CJ 


CD 




CD 






CJ 






1 — 






CD 


f2 




CJ 


— 1 




CD 


U 




CJ 






4-> 


CJ 




CJ 






CJ 


<C 


h- 


CJ 


CL. 




CD 


CJ 










4-> 












U 




>- 




2: 






a 




CJ 






CD 


CJ 








rH 


CD 


h- 


CO 




i— i 




ra 






CnJ 




\ 




rH CD 




^ CJ 




rH 


(J 






rH CJ 




C\J 


CJ 




2: 


lo cj 


a. 


» — 1 





rH CD 

t-h <r 
c\j < 

CD 

< 

CD 



O 

<C CT> CJ 



CJ 



CJ 

a_ t— t 4-> oo i— i 



u 

CJ 
CD 

§ 

CJ 
I — 

<c 

CJ 
CJ 

< 

CJ 

o 

< 

CJ 

CO CJ 

o 



CD > 

CJ -J 

CD 
CD 

CJ 
CJ> 

CD <C 



a 

CJ 

< 

CJ 
CJ 



CO 



CJ 

I— 

CD 
CD 

h- 

CD 
4-> 



CnJ 



CO 
CO 



CJ 
CJ 



CO 



rH (J 

cn cj 
co < 



lo 



00 



rH I CO t-H 



CD 

CD > 

CD Q 

CD 
CD 

a3 a: 

CJ 
< 

3 = 

CJ 

< 

CD O 



CD 
h- 
CJ 

CD 



CJ> 

f2. 

CJ 
CJ 
CD 

s, 

CD 



CJ> 
I — 
CD 



CO 



CJ . 
CJ 



CD 
CD 

cn 



CD Q 

o 

CJ 

CD < 

CJ 
CD 

CD CD 



cn 



CJ 

cj a. 
cn 

CD 

rH CJ rH 

LO CD r^. 

rH CD CD rH 

rH CJ r-l 
LO CD rH 
CD CD LO 

cn 



I — >- 

l2? 

CJ _J 

CJ 
CJ 

CD <£ 
CD 

h- 
< 



CD 

< 

CD LU 



CJ 

< I— 

CJ 

CD O 

CD LU 
CD 

CJ — I 

CJ — I 

CD Q 

I- 



CO CJ . 

rH cn 
i— 

CNJ CJ 



CD < 



rH CD 

CM CD 



4J 
CJ 

CJ Q- 

CJ cr 

cn 
I — 

CD > 

h- 
CD 

h- CJ 
CJ 

t=u. 

u 
f— 

CD > 

CD rH 
< LO 
CD LU rH 



CD 

f— 

CD 
I — 
CD 



CJ 



cn 

CD 

ra ad 
CD > 

a 

CD 

CD CD 

r? 

CJ _J 
CD LU 



CJ 

CJ CL. 



rH CJ 
O CD 
CO CD CD 



rH CJ 
LO 1 — 
CO CD 



3$ 



CO 



CJ 



WO 97/47358 



20/22 



PCTYUS97/09884 



GTg 






ACC 


i— 




CAg 






TAc 


>- 




GCC 


<c 




4-> 

CJ _ 
CD <C 




GTg 






GAG 


LU 


CD 
CD 
fO 






TGC 


CJ 




TAC 


>- 




CTg 


—1 




CD 
CD 
(O 


cc 




h- 

CJ 

l — co 




CTg 


1 




ACt 


h- 


CAG 


o 




TAc 


>- 




ATc 


• — i 




CD 
CD 
ro 


C£ 




TGC 


o 




CJ 
CJ 

CD <£ 




GAC 


Q 




CJ 


U_ 


GGc 


CD 




O 

o 

CD <C 




TCc 


LO 




GAG 


UJ 




CD 
CD 










GAt 


Q 




GTc 


> 


CCT 


Q- 




TTt 


Lu 




GAG 


LU 




ACt 


f— 




cn 
CD 
ra 


oz. 




CD 

U 1 




GGc 


CD 




CD 
CD 
ra 




TCc 


lo 




GGC 


CD 




GAG 


LU 




CTg 


—1 




TAc 


>- 




CJ 

£>. 




TGt 


CJ 




CTg 




TAC 


>- 




ATG 


s: 




GTg 


> 




TCc 


LO 




GGc 


CD 




o 

CD 

1- CJ 




GTc 


> 




CJ 

u 

-M 


LO 


CAG 


o* 




CCc 


Q_ 




cn 
CD 






AGG 


Cd 




TGt 


CJ 




a 

CJ 

< h- 




CTg 


-J 




GCc 


< 


CJ 




r-H 

CXI 


AAC 




i — \ 

CO 
C\J 


ATC 


1 — 1 


H 
LO 
CM 


ATc 


t—i 


r-H 

CXI 


AAC 




i— i 

CTi 
CM 


CD 

CJ -J 


i — 1 
< — 1 

CO 


ATG 




T—i 

CO 
CO 


GCt 


< 



o 

CD 



CO 



LO CD CD UD 



f 



LO 
1^ 



CD 



O 
CJ 



CO 



o 

CJ 



LO 



CO 



LO 



CJ 

£</> 

CJ 
CD 

CD CD 



CJ 
CD 

CJ 

CJ 
CJ 
CJ 

cn 
1 — 

CJ 



CD 
CD 



CJ 

CD <C 



CD 
I — 
CD 

CJ 



4- 1 CO 
CD LU 



CD 
I — 

CD > 
+-> 



CJ 

cj 
< 

CD Q 



Q_ CXI 



CD 
MJ 



t= 

CD 



LU 



i—i CJ 
CXI CD 

cxi h- cj 
r—H cn 

VO CD 
CO fO 



to 


> 


r-H 

oo 


6 


CD- 


CAG 


cr 




a 

CD 
CD 


CD 


AGg 






AAg 




<_> 




TCc 


LO 


GAG 


UJ 




AAc 




cct 


Q- 




a 

<t l — 


GCt 


< 




CTG 


-J 


U 






CCC 


Q_ 


GAC 


Q 




GGC 


CD 


GT 




UD CD 





oo 



CJ 
CJ 

<C 

CJ 



CO 
CP. 



CD 



+J 

CD 



Si- 
a 

<c i — 



CJ 
CD 

I — CJ 

cd a 

§* 

CJ — I 
CD 



(J 
CJ 

CD < 



CT» 



3 
I 

s 

CJ 
CJ 

<c 

CJ 

S 

CJ 
CD 



LU 



CD 



CD 

3 

CD 
i — 
CJ 



CD 
< 

CD LU 

+-> 
CJ 

cj a. 

CJ CD* 

CJ 
CJ 

cj a. 

r— < 

LO 4-> 

CO CJ 
x O Q- 

T-H 

LO CJ 

S CD O 

CJ 
CD 

CD CD 

+-> 
CJ 

CJ Q- 

U 
CJ 

cj a_ 

CJ 
CJ 

CD < 



LO 



a 

4-> LO 



CXI 



H U 

(XI CD 
I s *- 1 — CJ 



CJ CXI CD CD 

g 

<C i — i 



CO 





CD 














CT 


_l 




CD <C 


CD LU 












cn 


I— 












CD 


CD 






CD 






ra CC 


r— CJ 
















1 — 1 


1—1 


O 




r-H 


u 


r-H CJ 




CO 






O 


CD 


CXI V~ 


co 


eg 


s 


CD 


CO 


V- CJ 


CO <£ m 
















r-H 


r-H 


■M 




r-H 


4-> 


r-H CD 


CXI 




U 




CD 


CJ 


h- 


CD 


CO 


+-> 


LO 


Cn CD <£ CT> CD > 


r-H 



CJ 

£>. 

CD 
CD 

< C£ 

a 

CJ 

< r- 

U 
CJ 

CD < 



WO 97/47358 



21/22 



PCT/US97/09884 



GTc 


> 


cn 

CD 


GCC 


<c 


f 




CAG 


at 


GAg 


UJ 


cn 

CD 

< C£ 


TGT 


CJ 


CD 
CD 
(O 




GCc 


< 


TGG 




GAg 


UJ 


CCc 


CL 


GGc 


CD 


CD 
CD 


ACc 




AAG 




ACt 




CTG 




CTg 


_J 


CTg 


1 


cct 


CL. 


CD > 


GCC 


< 


GGC 


CD 


GAg 


UJ 


ACC 


i — 


CAg 


CT 


GAC 


Q 


TCc 


OO 


CD 
CD 


GCt 


<: 


TCT 


oo 


TGG 


3: 


CCC 


Q- 


GAG 


UJ 


CTg 


1 


TAC 


>- 


CD 

U — 1 


AGG 




GCc 


< 


GCc 


< 


GCc 


< 


CAG 


CT 


CCc 


O. 


tec 


oo 


cj 
cj 

O Q- 


GGc 


CD 


GAt 


o 


GCt 


<: 


TAt 


>- 


GCc 


<£ 


GAg 


UJ 


CAC 


m 


U 
CJ 

CJ Q. 


GGc 


CD 


CAt 




AGG 




ATG 


21 


CTg 


_J 


ATT 


I— i 


CTg 


_i 


cn 

CD > 


CAG 


CT 



co cj 

^ CD < 

r-H 

t— ♦ cn 



CT> CJ 
CO CJ 
^ CD <£ 
1 — I 

r\ cn 



^r- I — 



cj 



CO CJ 
C\J I — 



co cn 
^J* I— 
^ cj . 
i — i 

CT> CJ 
CXJ I — 



LO CJ 

St 

t-H 

LO CJ 

CO < 



r-H 




i — 1 




1— 1 




i— 1 




CJ 




CJ 


r-H 


CJ 


CO 




CJ 




CD 


LO 


CJ 


LO 




1 — 


oo ^ 


CD CD 




1— 00 




, — i 




1 — 1 




1 — 1 




r-H 


T— 1 






cn 


CO 


CD 


Ch 




r 




CT 


LO 


CT 

L 


LO 



CJ 

CD <C 

+-> 
CJ 

CJ CL. 

CJ 
h- 

CJ 
CJ 

cj a. 

o 

C_J 

< I — 

CD 
h- 

CJ —J 

cn 
I — 

CJ — ! 



CD 
CD 
< 



CJ 

o 

t 

CJ 
CJ 



00 



oo 



LO 
CO 



CJ 
CJ 

I — oo 

CJ 
CD 

h- o 

CJ 
CJ 

I — oo 

CJ 
CJ 

<C I— 



u 

CJ 

CJ CL. 


AAC 




TCC 


on 


ACc 


1 — 


GCc 


< 


AAg 




cn 
I — 

CJ —J 


CJ 
CJ 

<C 1 — 


GGC 


CD 


TTC 


u_ 


GCC 


<c 


tct 


00 


AGG 






CJ 

Si- 


CTg 


_l 


DLL 


u_ 


GGc 


CD 


CTg 


_j 


CTg 


— 1 


a 

CJ 

CD < 


CJ 
CJ 

CJ Q- 


TGG 




CAC 


3: 


TAt 


>- 


GGc 


CD 


TGC 


CJ 


cn 

CD 


CJ 

< 

CD Q 


TCC 


OO 


ACC 


t— 


ATt 


i— 1 


CAt 


in 


TCc 


00 


cn 
h- 

CD > 


cn 

CD _ 


AAC 




ATG 


2: 


CAG 




CTg 


_j 


GCc 


< 


a 

4-> 00 


a 

CJ 

< 1 — 


GTg 


> 


CTG 


_i 


0 

CD 
h- 


C_> 


cn 
CD 


cc 


GTG 


> 


cn 

CD 

< 


1— 

CJ 1 


cct 


CL- 


ATc 


»—i 


GGc 


CD 


CAG 


CT 


AGG 




CJ 
CJ 

CD <C 



CD 
I— 
CD 



CJ 

CD <C 



OO CJ 
CO < 



I — >- 



CD 
<5fr 



00 C7> 
O I— 



5 y 



CT 
CxJ 



>- r-H 



CJ 
CJ 

< 

CJ 

6 



CNJ CD 

r-H 

LO CD 
CnJ CD 

r-H < 



CJ 



LO 



CNJ CJ 00 
CO CJ CO 
H CD < H 



CJ 
I — 

CJ 



00 



<: »— • r-H 



CJ 



CD CD 
LO CD 

^ 03 ct: 

r-H 

CD U 

LO < 

H CD X 



CD 
CD 



CJ 



CJ 

tu_ 

cn 
I— 

o — 1 

r-H 

CNJ CD 
LO < 
^ < ^ 

r-H 

LO O 

LO CD 

« — 1 CD CD 



WO 97/47358 



PCT/US97/09884 



22/22 



o 

<£ 

CD O 

(J 
CD 

CD CD 

CJ 
CD 

CD CD 

+-> 
U 



CO 



cn 
I— 

CD : 

h- 

CJ 
I — 

cj . 

cn 
h- 

cj . 

cx> 
l — 

cj . 

o> 
h- 

cj . 

CJ) 
CD 



CJ) 



CD 



CD 
CD 



LO f— 
LO CJ 
^ CD 



r-. CD 



LO 



LO 
LO 



CJ) 



CD 
CD 



CJ 
CD 

CD CD 



4-> 
CJ 



GO 



LO 



CD 
I — 
CJ 

CJ 

3 

CD 
h- 
CJ 

CD 
<C 
CJ 

CJ 
CJ 



uo 



TTC 


Ll. 


i — ) 


TGA 


* 


TGG 






3 




CD 
CD 
fO 


a: 




AAC 




CCC 


CL. 




CCC 


o_ 


CD 
CD 






CTg 


_J 


GCC 


< 




CTG 


— 1 


CD 
CD 
fO 






TAC 


>- 


TCc 


IS) 




ATC 


I — 1 


CTG 


_J 




GGC 


CD 


tcC 


in 




GTg 


> 



LO 



CJ 
CJ 
CD 



CO CJ 
LO <C 

HUI 



u 

CD 

CD CD 



a 

CO 

6 



INTERNATIONAL SEARCH REPORT 



International application No. 
PCT/US97/09884 



A. CLASSIFICATION OF SUBJECT MATTER 

IPC(6) : A61N 43/04; C12Q 1/68; C12N 15/00; C07H 21/02; A61K 39/00 
US CL ; 514/44; 435/6, 320.1; 536723.1; 434/184.1, 192.1 
According to International Patent Classification (IPC) or to both national classification and IPC 

B. FIELDS SEARCHED 

Minimum documentation searched (classification system followed by classification symbols) 
U.S. : 514/44; 435/6, 320.1; 536723.1; 434/184.1, 192.1 

Documentation searched other than minimum documentation to the extent that such documents are included in the fields searched 
none 



Electronic data base consulted during the international search (name of data base and, where practicable, search terms used) 
APS, STN, MEDLINE, BIOSIS, EMBASE, CAPLUS, WPIDS, SCISEARCH 



C. DOCUMENTS CONSIDERED TO BE RELEVANT 

Category* Citation of document, with indication, where appropriate, of the relevant passages Relevant to claim No. 



X Selby et al. Expression, identification and subcellular 1-3 

localization of the proteins encoded by the hepatitis C viral 
genome. Journal of General Virology. 1993. Vol. 74, pages 
1 1 03-1 1 1 3, see entire document. 

X Bukh et al. Sequence analysis of the core gene of 14 1-3 

hepatitis C virus genotypes. Proc Natl. Acad. Sci. August 
1994. Vol. 91, pages 8239-8243, see entire document. 

Y Lathe. Synthetic Oligonucleotide Probes Deduced from Amino 1 -3 

Acid Sequence Data Theoretical and Practical Considerations. 
J. Mol. Biol. 1985. Vol. 183, pages 1-12, see entire 
document. 



fx] Further documents are listed in the continuation of Box C. Sec patent family annex. 



Special categories of cited document!: 

document defining the general state of the art which is not considered 
to be of particular relevance 

earlier document published on or after the international filing date 

document which may throw doubti oo priority clnim(s) or which is 
cited to esta bl ish the publication date of another citation or other 
■oecial reason (at specified) 



later document published after the international filing date or priority 
date and not in conflict with the application but ched to understand the 
principle or theory underlying the invention 

document of particular relevance; the claimed invention cannot be 
considered novel or cannot be considered to involve an inventive step 
lata* 



document referring to an oral disclosure, use, exhibition or other 

document published prior to the intemational filing date but later than 
the priority date claimed 



document of particular relevance; the claimed invention cannot be 
considered to involve an inventive step when the document is 
combined with one or more other such documents, such combination 
being obvious to a person skilled in the art 



document member of the i 



s patent family 



Date of the actual completion of the international search 
28 AUGUST 1997 


Date of mailing of the international search report 

1 1 SEP 1997 


Name and mailing address of the ISA/US 
Commissioner of Patents and Trademarks 
Box PCT 

Washington, D.C. 20231 
Facsimile No. (703) 305-3230 


Authorized offi ^^2^fJ / \f<. 

ANDREW WANG 
Telephone No. (703) 308-0196 



Form PCT/ISA/210 (seeond sheet)(July 1992)* 



INTERNATIONAL SEARCH REPORT 



International application No. 
PCT/US97/09884 



C (Continuation). DOCUMENTS CONSIDERED TO BE RELEVANT 


Category* 


Citation of document, with indication, where appropriate, of the relevant passages 


Relevant to claim No. 


Y 


Grantham et al. Codon catalog usage is a genome strategy 
modulated for gene expressivity. Nucleic Acids Research. 1981. 
Vol. 9, No. 1, pages r43-r74, see entire document. 


1-3 


A,P 


Ide et al. Characterization of the nuclear localization signal and 
subcellular distribution of hepatitis C virus nonstructural protein 
NS5A. Gene. December 1996. Vol. 182, pages 203-211, see 
entire document. 


1-3, 8-26 


X 


US 5,514,539 A (BUKH et al.) 07 May 1996, see entire 
document. 


1-3, 8-26 



Form PCT/ISA/210 (continuation of second sheet)(July 1992)* 



INTERNATIONAL SEARCH REPORT 



International application No. 
PCT/US97/09884 



Box I Observations where certain claims were found unsearchable (Continuation of item 1 of first sheet) 



This international report has not been established in respect of certain claims under Article 17(2)(a) for the following reasons: 
1. Q Claims Nos.: 

because they relate to subject matter not required to be searched by this Authority, namely: 



2. Claims Nos.: 4-7 

because they relate to parts of the international application that do not comply with the prescribed requirements to such 
an extent that no meaningful international search can be carried out, specifically: 

The enclosed copy of claims 4, 5 were not legible and claims 6, 7 depend on those claims. 

3. Q Claims Nos.: 

because they are dependent claims and are not drafted in accordance with the second and third sentences of Rule 6.4(a). 



Box II Observations where unity of invention Is lacking (Continuation of item 2 of first sheet) 



This International Searching Authority found multiple inventions in this international application, as follows: 



1- ["I As all required additional search fees were timely paid by the applicant, this international search report covers all searchable 
claims. 

2 * 0 aU 8carchaDlc claims could be searched without effort justifying an additional fee, this Authority did not invite payment 
of any additional fee. 

3 * CI on * y 80me °^ ^ e re ^ u ^ rcd additional search fees were timely paid by the applicant, this international search report covers 
only those claims for which fees were paid, specifically claims Nos.: 



4- [^] No required additional search fees were timely paid by the applicant. Consequently, this international search report is 
restricted to the invention first mentioned in the claims; it is covered by claims Nos.: 



Remark on Protest | | The additional search fees were accompanied by the applicant's protest. 

| ] No protest accompanied the payment of additional search fees. 



Form PCT/1SA/210 (continuation of first sheet(l))(Ju!y 1992)* 



